TaWeka (Taverna-to-Weka) is a Java application that connects data gathering in Taverna with data mining using Weka. The aim is to speed up the creation and comparison of biological classifiers, while simplifying sharing and reuse.
TaWeka:
TaWeka v 0.1 uses SQL queries as user specifications (step 2 above). In a benchmark abstracting five data mining scenarios, v 0.1 hits trouble when moving from simple classifiers to genomic learning. To solve this, I'm now working on a new TaWeka incorporating a lightweight semantic layer to better support the data collection narrative, which is where researchers spend most of their time.