data-dumper-weka-package
Weka package that allows listening in on data as it passes through filter pipelines (aka weka.filters.MultiFilter), classifiers or clusterers.
Available schemes
-
Dumper schemes
weka.core.dump.Null-- dummy, does nothingweka.core.dump.InMemory-- not for GUI use, but API use, as it notifies registered listeners whenever the data changes.weka.core.dump.SaveToDisk-- stores the data in the specified file on disk (auto-detects file format based on extension); allows appending a timestamp to output separate files during cross-validation.
-
Integration
weka.classifiers.meta.Dumperweka.clusterers.Dumperweka.filters.Dumper
Example usage:
When used in conjunction with the weka.filters.MultiFilter, it is possible to output the data at any given stage in the filter pipeline by inserting weka.filters.Dumper instances where required. Here is an example layout:
- weka.filters.MultiFilter
|
+ weka.filters.Dumper -dumper "weka.core.dump.SaveToDisk -output-file ./1-initial.arff"
|
+ weka.filters.unsupervised.attribute.AddNoise"
|
+ weka.filters.Dumper -dumper "weka.core.dump.SaveToDisk -output-file ./2-with_noise.arff"
|
+ weka.filters.unsupervised.attribute.Normalize"
|
+ weka.filters.Dumper -dumper "weka.core.dump.SaveToDisk -output-file ./3-normalized.arff"
Releases
How to use packages
For more information on how to install the package, see:
https://waikato.github.io/weka-wiki/packages/manager/
Maven
Use the following dependency in your pom.xml:
<dependency>
<groupId>com.github.fracpete</groupId>
<artifactId>data-dumper-weka-package</artifactId>
<version>2019.9.13</version>
<type>jar</type>
<exclusions>
<exclusion>
<groupId>nz.ac.waikato.cms.weka</groupId>
<artifactId>weka-dev</artifactId>
</exclusion>
</exclusions>
</dependency>