You are looking at historical revision 21803 of this page. It may differ significantly from its current revision.

Dataset Utilities

A set of routines to load and manage datasets for machine learning / data mining tasks.

Exported Procedures

Creating datasets

Managing datasets

[procedure] (attribute-name attribute)

Returns the name of given attribute.

[procedure] (attribute-definition attribute)

Returns a definition of the type of given attribute.

[procedure] (relation-name relation)

Returns the name of given relation.

[procedure] (relation-attributes relation)

Returns a list of attributes for given relation.

[procedure] (relation-data relation)

Returns a list of the instances in the given relation.

[procedure] (get-attribute-values relation attribute-name)

Returns the values taken by instances in relation for given attribute name.

[procedure] (entropy relation attribute-name)

Computes entropy of given relation, using attribute-name to divide the relation into groups. attribute-name should be a nominal attribute.

[procedure] (filter-instances relation attribute-name value)

Returns a new relation containing those instances of relation which have the given value for attribute-name.

[procedure] (find-attribute-index relation attribute-name)

Returns the index number of given attribute name in relation.

[procedure] (split-instances relation attribute-name)

Given a nominal attribute, returns a list of relations, each representing instances in relation with the same value for given attribute-name.

Importing Data

[procedure] (read-arff filename)

Reads an ARFF definition from given filename, and returns a relation.

Author

Peter Lane.

License

GPL version 3.0.

Version History

in progress.