Different Types of Data in AI Models
AI models can work with many types of data. Some common types include:
labeled, unlabeled,
tabular, time-series,
image, text,
structured, and unstructured data.
Each type is used for different AI problems and models.
π 1. Labeled Dataβ
What is it?
Labeled data means each data point has an answer or tag.
Example:
- An image with the label βcatβ.
- A sentence labeled as βpositiveβ sentiment.
- A medical record labeled as βdiabetes: yesβ.
Used in:
Supervised learning (where the model learns from correct answers).
β 2. Unlabeled Dataβ
What is it?
Unlabeled data has no tags or answers β just raw input.
Example:
- A folder of photos with no description.
- Web pages with no categories.
Used in:
Unsupervised learning (the model finds patterns by itself), or semi-supervised learning.
π 3. Tabular Dataβ
What is it?
Data in rows and columns β like spreadsheets or tables.
Example:
- Excel files.
- Customer databases.
- Financial records.
Used in:
Structured prediction, classification, regression (e.g., predicting house prices).
π 4. Time-Series Dataβ
What is it?
Data collected over time in a sequence.
Example:
- Stock prices every minute.
- Heart rate every second.
- Temperature every hour.
Used in:
Forecasting, anomaly detection, trend analysis.
πΌοΈ 5. Image Dataβ
What is it?
Pictures or visual information.
Example:
- Photos from a camera.
- Medical scans (X-rays, MRIs).
- Satellite images.
Used in:
Computer vision β for tasks like object detection, facial recognition.
π 6. Text Dataβ
What is it?
Words, sentences, or paragraphs β usually in human language.
Example:
- Emails.
- Chat messages.
- News articles.
Used in:
Natural Language Processing (NLP) β like sentiment analysis, translation, and chatbots.
π§© 7. Structured Dataβ
What is it?
Organized data in a fixed format β easy to store and search.
Example:
- SQL databases.
- CSV files.
Used in:
Traditional ML models like decision trees, logistic regression.
π 8. Unstructured Dataβ
What is it?
Data without a predefined format.
Example:
- Images.
- Videos.
- Audio.
- Free-form text.
Used in:
Deep learning, especially with CNNs (for images), RNNs/Transformers (for text).
Summary Tableβ
Type | Description | Example | Common Use Cases |
---|---|---|---|
Labeled Data | Data with correct answers | Image labeled βcatβ | Supervised learning |
Unlabeled Data | Data without labels | Random photos | Unsupervised/semi-supervised |
Tabular Data | Rows and columns | Sales spreadsheet | Prediction, regression |
Time-Series Data | Data over time | Heart rate logs | Forecasting, monitoring |
Image Data | Visual data | X-ray scans | Classification, detection |
Text Data | Written or spoken words | Chat messages | NLP, sentiment analysis |
Structured Data | Organized with format | Database records | Traditional ML |
Unstructured Data | No fixed format | Images, videos, articles | Deep learning, GenAI |