Skip to content

Class: Dataset

Base dataset container (dspy.Dataset).

URI: dspy:Dataset

 classDiagram
    class Dataset
    click Dataset href "../Dataset/"
      NamedThing <|-- Dataset
        click NamedThing href "../NamedThing/"


      Dataset <|-- Colors
        click Colors href "../Colors/"
      Dataset <|-- HotPotQA
        click HotPotQA href "../HotPotQA/"
      Dataset <|-- MATH
        click MATH href "../MATH/"
      Dataset <|-- AlfWorld
        click AlfWorld href "../AlfWorld/"
      Dataset <|-- GSM8K
        click GSM8K href "../GSM8K/"


      Dataset : description

      Dataset : devset





        Dataset --> "*" Example : devset
        click Example href "../Example/"



      Dataset : id

      Dataset : name

      Dataset : splits





        Dataset --> "0..1" Any : splits
        click Any href "../Any/"



      Dataset : testset





        Dataset --> "*" Example : testset
        click Example href "../Example/"



      Dataset : trainset





        Dataset --> "*" Example : trainset
        click Example href "../Example/"



Inheritance

Slots

Name Cardinality and Range Description Inheritance
trainset *
Example
Training split used by an optimizer direct
devset *
Example
Dev / evaluation split direct
testset *
Example
Held-out test split direct
splits 0..1
Any
Dataset split sizes / metadata direct
id 1
Identifier
Stable opaque identifier for an instance NamedThing
name 0..1
String
Short human-readable name NamedThing
description 0..1
String
Free-form description NamedThing

In Subsets

Identifier and Mapping Information

Annotations

property value
dspy_source_paths ['dspy/datasets/dataset.py']
dspy_source_urls ['https://github.com/stanfordnlp/dspy/blob/main/dspy/datasets/dataset.py']

Schema Source

Mappings

Mapping Type Mapped Value
self dspy:Dataset
native dspy:Dataset
exact nexus:Dataset
related ucs_core:InformationContentEntity

LinkML Source

Direct

name: Dataset
annotations:
  dspy_source_paths:
    tag: dspy_source_paths
    value:
    - dspy/datasets/dataset.py
  dspy_source_urls:
    tag: dspy_source_urls
    value:
    - https://github.com/stanfordnlp/dspy/blob/main/dspy/datasets/dataset.py
description: Base dataset container (``dspy.Dataset``).
in_subset:
- datasets
from_schema: https://w3id.org/lmodel/dspy
exact_mappings:
- nexus:Dataset
related_mappings:
- ucs_core:InformationContentEntity
is_a: NamedThing
slots:
- trainset
- devset
- testset
- splits

Induced

name: Dataset
annotations:
  dspy_source_paths:
    tag: dspy_source_paths
    value:
    - dspy/datasets/dataset.py
  dspy_source_urls:
    tag: dspy_source_urls
    value:
    - https://github.com/stanfordnlp/dspy/blob/main/dspy/datasets/dataset.py
description: Base dataset container (``dspy.Dataset``).
in_subset:
- datasets
from_schema: https://w3id.org/lmodel/dspy
exact_mappings:
- nexus:Dataset
related_mappings:
- ucs_core:InformationContentEntity
is_a: NamedThing
attributes:
  trainset:
    name: trainset
    description: Training split used by an optimizer.
    from_schema: https://w3id.org/lmodel/dspy
    rank: 1000
    owner: Dataset
    domain_of:
    - KNN
    - Dataset
    range: Example
    multivalued: true
    inlined: true
    inlined_as_list: true
  devset:
    name: devset
    description: Dev / evaluation split.
    from_schema: https://w3id.org/lmodel/dspy
    rank: 1000
    owner: Dataset
    domain_of:
    - Evaluate
    - Dataset
    range: Example
    multivalued: true
    inlined: true
    inlined_as_list: true
  testset:
    name: testset
    description: Held-out test split.
    from_schema: https://w3id.org/lmodel/dspy
    rank: 1000
    owner: Dataset
    domain_of:
    - Dataset
    range: Example
    multivalued: true
    inlined: true
    inlined_as_list: true
  splits:
    name: splits
    description: Dataset split sizes / metadata.
    from_schema: https://w3id.org/lmodel/dspy
    rank: 1000
    owner: Dataset
    domain_of:
    - Dataset
    range: Any
  id:
    name: id
    description: Stable opaque identifier for an instance.
    from_schema: https://w3id.org/lmodel/dspy
    rank: 1000
    slot_uri: schema:identifier
    identifier: true
    owner: Dataset
    domain_of:
    - NamedThing
    range: Identifier
    required: true
  name:
    name: name
    description: Short human-readable name.
    from_schema: https://w3id.org/lmodel/dspy
    rank: 1000
    slot_uri: rdfs:label
    owner: Dataset
    domain_of:
    - NamedThing
    - LMToolCallPart
    - LMMessage
    - LMToolSpec
    range: string
  description:
    name: description
    description: Free-form description.
    from_schema: https://w3id.org/lmodel/dspy
    rank: 1000
    slot_uri: dct:description
    owner: Dataset
    domain_of:
    - NamedThing
    - LMToolSpec
    range: string