This is a dataset created by crawling high Karma (upvote count) posts' external webpage links from RedditWebText 모방으로 reddit 기반이라 퀄리티 그렇게 좋지는 않다. Skylion007/openwebtext · Datasets at Hugging FaceWe’re on a journey to advance and democratize artificial intelligence through open source and open science.https://huggingface.co/datasets/openwebtextstas/openwebtext-10k · Datasets at Hugging FaceWe’re on a journey to advance and democratize artificial intelligence through open source and open science.https://huggingface.co/datasets/stas/openwebtext-10k