Dataset

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2021 Mar 27 13:16
Editor
Edited
Edited
2025 Jun 17 10:29

Dataset for AI are three types

  • Problems with solution -
    SFT

Dataset for AI are three types

  • Problems with solution -
    SFT
We typically say that a dataset is high-dimensional if the number of data points N is smaller than the dimensionality D
  • not cheatable
  • large degree of intra-class variability
Datasets
 
 
Dataset Usages
 
 
 
I Built an AI Chatbot Based On My Favorite Podcast
Sponsored By: Reflect In the future, any time you look up information you're going to use a chatbot. This applies to every piece of information you interact with day to day: personal, organizational, and cultural.
I Built an AI Chatbot Based On My Favorite Podcast
Streamlit
20 Open Datasets for Natural Language Processing
Natural language processing is a significant part of machine learning use cases, but it requires a lot of data and some deftly handled…
20 Open Datasets for Natural Language Processing
Andrej Karpathy on Twitter / X
We have to take the LLMs to school.When you open any textbook, you'll see three major types of information:1. Background information / exposition. The meat of the textbook that explains concepts. As you attend over it, your brain is training on that data. This is equivalent… pic.twitter.com/m9vJj4AjV8— Andrej Karpathy (@karpathy) January 30, 2025
Andrej Karpathy on Twitter / X
 
 

Recommendations