UCI Machine Learning Repository: Iris
Iris - UCI Machine Learning Repository
Donated on: 6/30/1988
A small classic dataset from Fisher, 1936. One of the earliest known datasets used for evaluating classification methods.
Dataset Characteristics
Type: Tabular
Subject Area: Biology
Associated Tasks: Classification
Feature Type: Real
Number of Instances: 150
Number of Features: 4
Dataset Information
What do the instances in this dataset represent?
Each instance is a plant.
Additional Information:
This is one of the earliest datasets used in the literature on classification methods and widely used in statistics and machine learning.
The dataset contains 3 classes of 50 instances each, where each class refers to a type of iris plant.
One class is linearly separable from the other 2; the latter are not linearly separable from each other.
Predicted attribute: class of iris plant.
This is an exceedingly simple domain.
Note: This data differs from the data presented in Fisher's article (as identified by Steve Chadwick). There are corrections in the 35th and 38th samples.
Has Missing Values? No
Introductory Paper
The Iris data set: In search of the source of virginica (Unwin & Kleinman, 2021)
Published in Significance, 2021
Variables Table
Variable Name | Role | Type | Description | Units | Missing Values |
|---|---|---|---|---|---|
sepal length | Feature | Continuous | cm | no | |
sepal width | Feature | Continuous | cm | no | |
petal length | Feature | Continuous | cm | no | |
petal width | Feature | Continuous | cm | no | |
class | Target | Categorical | Iris Setosa, Iris Versicolour, Iris Virginica | - | no |
Dataset Files
File | Size |
|---|---|
bezdekIris.data | 4.4 KB |
iris.data | 4.4 KB |
iris.names | 2.9 KB |
Index | 105 B |
Citation
Fisher, R. (1936). Iris [Dataset]. UCI Machine Learning Repository. https://doi.org/10.24432/C56C76
License
This dataset is licensed under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
Keywords
ecology
Creator
R. A. Fisher

