Arena
Ko Chatbot Arena Leaderboard - a Hugging Face Space by instructkr
Discover amazing ML apps made by the community
https://huggingface.co/spaces/instructkr/ko-chatbot-arena-leaderboard
Leaderboard
LogicKor | 한국어 언어모델 다분야 사고력 벤치마크
LogicKor은 한국어 언어모델의 다분야 사고력을 측정하는 벤치마크입니다. 추론, 수학, 글쓰기, 코딩, 이해, 문법 등 다양한 분야의 사고력을 측정합니다.
https://lk.instruct.kr/
Benchmark
HAERAE-HUB/KMMLU · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/HAERAE-HUB/KMMLU
Collection of 8 tasks to evaluate natural language understanding capability of Korean language models
klue · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/klue
Comment Dataset
hate dataset
kor_hate · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/kor_hate/viewer/default/train?f[contain_gender_bias][value]=1
jeanlee/kmhas_korean_hate_speech · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/jeanlee/kmhas_korean_hate_speech/viewer/default/train?p=1
SJ-Donald/kor-hate-sentence · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/SJ-Donald/kor-hate-sentence/viewer/default/train
jason9693/APEACH · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/jason9693/APEACH
long
Bingsu/KcBERT_Pre-Training_Corpus · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/Bingsu/KcBERT_Pre-Training_Corpus
조현병
Mental_illness/Word2Vec modeling.ipynb at master · chiheon/Mental_illness
자기주도연구2 - 온라인 커뮤니티 내에서 Mental illness 측정을 위한 모델 및 시각화 개발 - chiheon/Mental_illness
https://github.com/chiheon/Mental_illness/blob/master/Word2Vec%20modeling.ipynb
Chat data
AI-Hub
※샘플데이터는 데이터의 이해를 돕기 위해 별도로 가공하여 제공하는 정보로써 원본 데이터와 차이가 있을 수 있으며, 데이터에 따라서 민감한 정보는 일부 마스킹(*) 처리가 되어 있을 수 있습니다.
https://aihub.or.kr/aihubdata/data/view.do?currMenu=&topMenu=&aihubDataSe=data&dataSetSn=71630
AI-Hub
※샘플데이터는 데이터의 이해를 돕기 위해 별도로 가공하여 제공하는 정보로써 원본 데이터와 차이가 있을 수 있으며, 데이터에 따라서 민감한 정보는 일부 마스킹(*) 처리가 되어 있을 수 있습니다.
https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=realm&dataSetSn=114
Instruction data
kyujinpy/KOpen-platypus · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/kyujinpy/KOpen-platypus
kyujinpy/KoCoT_2000 · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/kyujinpy/KoCoT_2000
nlpai-lab/databricks-dolly-15k-ko · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/nlpai-lab/databricks-dolly-15k-ko?row=2
kyujinpy/OpenOrca-KO · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/kyujinpy/OpenOrca-KO
heegyu/open-korean-instructions · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/heegyu/open-korean-instructions?row=1
Wiki Dataset
지식 학습에 좋다
lcw99/wikipedia-korean-20221001 · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/lcw99/wikipedia-korean-20221001
heegyu/namuwiki-extracted · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/heegyu/namuwiki-extracted
Token Dataset
kor_3i4k · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/kor_3i4k/viewer/default/train?p=2
Just search
AIHUB
Hugging Face – The AI community building the future.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets?sort=trending&search=kor_
Hugging Face – The AI community building the future.
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets?sort=likes&search=kor_
DPO Dataset
maywell/ko_Ultrafeedback_binarized · Datasets at Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
https://huggingface.co/datasets/maywell/ko_Ultrafeedback_binarized

Seonglae Cho