(전영훈) YoungHoon 4/16/24 (전영훈) YoungHoon 4/16/24

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

Discover Evalverse: A groundbreaking framework revolutionizing Large Language Model evaluation. With its unified approach and user-friendly features, Evalverse simplifies assessment, making AI advancements inclusive and comprehensive. Explore its key features and architecture, and witness its practical application in our demonstrative video. Join us in driving innovation and accessibility in AI technology with Evalverse!

(최유정) Eujeong 3/26/24 (최유정) Eujeong 3/26/24

Open Source All About Data Processing, Dataverse

Dataverse is a freely-accessible open-source project designed to streamline the extract, transform, and load (ETL) pipeline using Python. In this post, we delve into the origins of this project and shed light on its future prospects in the realm of open-source data processing.

(최유정) Eujeong 3/8/24 (최유정) Eujeong 3/8/24

(Almost) Zero Hallucination with RAG and Groundedness Check

Unraveling the mechanics behind translating LLM outputs into leaderboard scores. Exploring a spectrum of evaluation mechanisms that span from automated metrics to human assessments rooted in real-world applicability.

(최유정) Eujeong 2/23/24 (최유정) Eujeong 2/23/24

LLM Evaluation Part2. Mechanics Behind LLM Scoring Systems

Unraveling the mechanics behind translating LLM outputs into leaderboard scores. Exploring a spectrum of evaluation mechanisms that span from automated metrics to human assessments rooted in real-world applicability.

(최유정) Eujeong 2/1/24 (최유정) Eujeong 2/1/24

LLM Evaluation Part1. What is a Benchmark Dataset?

Want to know why and how we evaluate LLM models?

종원 황 6/16/23 종원 황 6/16/23

Reinterpreting the History of NLP-based AI through a Data-Centric Perspective

What insights can be gained by examining natural language processing (NLP) through a data-centric perspective? Explore our blog post that delves into the 70-year history of AI, covering rule-based systems, machine learning, deep learning, and the recent emergence of large language models.

Feed

Evalverse: Revolutionizing Large Language Model Evaluation with a Unified, User-Friendly Framework

Open Source All About Data Processing, Dataverse

(Almost) Zero Hallucination with RAG and Groundedness Check

LLM Evaluation Part2. Mechanics Behind LLM Scoring Systems

LLM Evaluation Part1. What is a Benchmark Dataset?

Reinterpreting the History of NLP-based AI through a Data-Centric Perspective