Weka+For+Newbies

This page has been filled mostly thanks to answers on the wekalist (http://list.waikato.ac.nz/mailman/listinfo/wekalist). So thanks to the participants (in particular Mark, Eibe, Tom & jmgomezhidalgo)! For any addition to this page, you can either create an account & edit it yourself or send an email to gm [AT] presans [DOT] com =Important Tips= = = =Java programming helps=
 * Since many people use Weka, lots of (basic & advanced) questions have already been asked on the mailing list. Therefore, using " wekalist" in your preferred search engine might help you get an answer faster than asking the same question again on the list **before** doing any research on your own first.
 * In case you would be **really lazy**: http://www.google.com/search?hl=en&q= %20wekalist
 * http://mindprod.com/jgloss/jcheat.html
 * http://mindprod.com/jgloss/jgloss.html
 * ==In French==
 * http://www.jmdoudoux.fr/accueil_java.htm

=Artificial Intelligence and Machine Learning Courses= =Machine Learning Intro Books= = = =Weka Reference Book= = = =Weka Intro & General Documentation= = = =Weka Development= = = =Weka Input & Output= = = =Weka Server Usage and Development=
 * Artificial Intelligence
 * https://www.udacity.com/wiki/cs271
 * Machine Learning
 * https://www.coursera.org/course/ml
 * http://www.cs.cornell.edu/Courses/cs4780/2013fa/#lectures
 * http://shop.oreilly.com/product/0636920025610.do
 * (Neural Networks) https://www.coursera.org/course/neuralnets
 * Specialized
 * (Probabilistic Graphical Models - PGM: Bayesian and Markov networks) https://www.coursera.org/course/pgm
 * (Social Network Analysis - SNA) https://www.coursera.org/course/sna
 * (Natural Language Processing - NLP)
 * https://www.coursera.org/course/nlp
 * https://www.coursera.org/course/nlangp
 * Weka
 * []
 * https://www.youtube.com/user/rushdishams
 * http://www.amazon.com/Machine-Learning-Tom-M-Mitchell/dp/0070428077/ref=sr_1_1?ie=UTF8&qid=1394186512&sr=8-1&keywords=tom+mitchell
 * http://www.amazon.com/Pattern-Classification-Pt-1-Richard-Duda/dp/0471056693/ref=pd_sim_b_5?ie=UTF8&refRID=1Z8Y81J1WHER2HDRYGP3
 * http://www.amazon.com/Artificial-Intelligence-Modern-Approach-Edition/dp/0136042597/ref=pd_sim_b_6?ie=UTF8&refRID=1Z8Y81J1WHER2HDRYGP3
 * http://www.amazon.com/Introduction-Machine-Learning-Adaptive-Computation/dp/026201243X/ref=sr_1_1?s=books&ie=UTF8&qid=1394186894&sr=1-1&keywords=Alpaydin-Introduction+to+Machine+Learning
 * http://www.amazon.com/Genetic-Programming-Computers-Selection-Adaptive/dp/0262111705/ref=sr_1_1?s=books&ie=UTF8&qid=1394186950&sr=1-1&keywords=Koza+Genetic_Programming_On_the_Programming_of_Computers_by_Means_of_Natural_Selection_Complex_Adaptive_Systems
 * http://www.amazon.com/Machine-Learning-Hackers-Drew-Conway/dp/1449303714
 * http://www.amazon.com/Mining-Social-Web-Facebook-LinkedIn/dp/1449367615/ref=pd_sim_b_6?ie=UTF8&refRID=0J9TP17C75CRAY0ETH2T
 * http://www.amazon.com/Machine-Learning-Email-Filtering-Priority/dp/1449314309/ref=sr_1_2?ie=UTF8&qid=1394186854&sr=8-2&keywords=Machine+Learning+Email+Filtering
 * http://www.amazon.com/Building-Machine-Learning-Systems-Python/dp/1782161406/ref=sr_1_1?s=books&ie=UTF8&qid=1394187002&sr=1-1&keywords=Coelho-Building+Machine+Learning+Systems+with+Python
 * http://www.amazon.com/Machine-Learning-Action-Peter-Harrington/dp/1617290181/ref=pd_sim_b_3?ie=UTF8&refRID=1DHDKNY77W2YM0W55HY9
 * ==In French==
 * http://www.amazon.com/Apprentissage-artificiel-algorithmes-Antoine-Cornu%C3%A9jols/dp/2212110200/ref=sr_1_6?ie=UTF8&qid=1394186532&sr=8-6&keywords=cornu%C3%A9jols (in French)
 * http://www.amazon.com/Data-Mining-Practical-Techniques-Management/dp/0123748569/ref=dp_ob_title_bk
 * ==Intro==
 * https://www.youtube.com/channel/UCXYXSGq6Oz21b43hpW2DCvw
 * http://www.ibm.com/developerworks/library/os-weka1/
 * http://downloads.sourceforge.net/project/weka/documentation/3.7.x/WekaManual-3-7-10.pdf
 * http://weka.wikispaces.com/Frequently+Asked+Questions
 * =General Documentation=
 * http://www.cs.waikato.ac.nz/ml/weka/documentation.html
 * http://www.cs.waikato.ac.nz/ml/weka/help.html
 * http://weka.wikispaces.com/
 * http://weka.wikispaces.com/Use+WEKA+in+your+Java+code
 * ==APIs==
 * http://weka.wikispaces.com/Use+WEKA+in+your+Java+code
 * http://weka.sourceforge.net/doc.stable/
 * http://weka.sourceforge.net/doc.dev/
 * ==Code Examples==
 * A Simple Text Classifier in Java with WEKA presents and discuses two little programs as examples of how to integrate WEKA into your Java code for text mining: http://jmgomezhidalgo.blogspot.com.es/2013/04/a-simple-text-classifier-in-java-with.html
 * Language Identification as Text Classification with WEKA explains how to build an automated language guesser for texts as a complete example of a Text Mining process with WEKA, and in order to demonstrate a more advanced usage of the StringToWordVector class: http://jmgomezhidalgo.blogspot.com.es/2013/05/language-identification-as-text.html
 * Sample Code for Text Indexing with WEKA shows how to index a text dataset using your own Java code and the StringToWordVector filter in WEKA: http://jmgomezhidalgo.blogspot.com.es/2013/06/sample-code-for-text-indexing-with-weka.html
 * http://permalink.gmane.org/gmane.comp.ai.weka/33249
 * http://permalink.gmane.org/gmane.comp.ai.weka/33302 (text classification full example)
 * [TO BE COMPLETED there are lots of examples on this same site, just find them :)]
 * ==Additions/Modification to inner code==
 * ===Modification of Weka===
 * http://weka.wikispaces.com/Subversion (get source from SVN)
 * ===Creating a package===
 * http://weka.8497.n7.nabble.com/Contributing-a-package-doubts-td30164.html
 * ==Migration==
 * Serialize from 3.6.8 (stable) / Deserialize to 3.7.10 (development) -> http://article.gmane.org/gmane.comp.ai.weka/33368
 * ==Weka Input==
 * ===Weka Configuration Files===
 * http://weka.wikispaces.com/weka_gui_explorer_Explorer.props
 * ===Weka & Excel (you should really use "flat" files like CSVs!)===
 * http://weka.sourceforge.net/packageMetaData/WekaExcel/index.html
 * ===Weka & CSV===
 * http://weka.wikispaces.com/CSV+file+conversion
 * http://weka.wikispaces.com/Converting+CSV+to+ARFF
 * ==Weka Output==
 * ===Making Predictions===
 * http://weka.wikispaces.com/Making+predictions
 * ===Saving Models===
 * http://weka.wikispaces.com/Saving+and+loading+models#Explorer
 * ===Evaluation===
 * http://blog.gmane.org/gmane.comp.ai.weka/month=20140301
 * http://weka.8497.n7.nabble.com/Evaluating-a-classifier-using-a-log-loss-function-td30182.html
 * http://list.waikato.ac.nz/pipermail/wekalist/2014-March/060079.html
 * []
 * http://weka.wikispaces.com/Remote+Experiment
 * []
 * []

=Weka & Memory= = = =Weka Platform-Specifics Problems= = = =Weka & Attribute Selection= = = =Weka & Clustering= = = =Weka & Classification=
 * http://weka.8497.n7.nabble.com/Memory-Issues-and-Weka-td30220.html
 * Mac
 * http://weka.8497.n7.nabble.com/weka-jar-does-not-run-on-mac-td30146.html
 * Windows
 * http://weka.wikispaces.com/LibSVM
 * Linux
 * http://weka.wikispaces.com/Performing+attribute+selection
 * http://list.waikato.ac.nz/pipermail/wekalist/2014-March/060075.html
 * http://micans.org/mcl/
 * Metrics for validation/comparison: there is none in Weka as of 2014/03, but some ideas can be found here:
 * http://permalink.gmane.org/gmane.comp.ai.weka/33317
 * [] (in section " Additional Functionality"). For instance:
 * []
 * []
 * []
 * ==Weka & SMO/SVM==
 * http://permalink.gmane.org/gmane.comp.ai.weka/33211

=Specific Applications and other tools=
 * ==Working with Text==
 * http://weka.wikispaces.com/Text+categorization+with+Weka
 * http://www.esp.uem.es/jmgomez/tmweka/index.html
 * http://jmgomezhidalgo.blogspot.com.es/2013/04/a-simple-text-classifier-in-java-with.html
 * http://jmgomezhidalgo.blogspot.com.es/2013/02/text-mining-in-weka-revisited-selecting.html
 * http://www.youtube.com/watch?v=IY29uC4uem8 (e.g. the text classification of the "Imdb data set")
 * http://weka.8497.n7.nabble.com/How-to-use-probabilistic-latent-semantic-analysis-PLSA-td30145.html (LSA)
 * ===Stemming===
 * http://comments.gmane.org/gmane.comp.ai.weka/33202
 * ==Search Engine / Reranking==
 * http://lemurproject.org/components.php
 * http://www.uni-marburg.de/fb12/kebi/research/software/WEKA-LR-PAGE
 * http://fantail.quansun.com/ (ranking prediction, multi-target regression, label ranking and metalearning)
 * http://svmlight.joachims.org/ (reformulation of SVM to do reranking)
 * ==Graphs==
 * http://micans.org/mcl/
 * ==BigData==
 * https://mahout.apache.org/
 * http://www.cs.waikato.ac.nz/ml/weka/bigdata.html
 * http://wiki.pentaho.com/display/DATAMINING/Handling+Large+Data+Sets+with+Weka
 * ==Named Entity Recognition (POS, NER)==
 * (2006) http://gate.ac.uk/
 * (2002) http://www.cnts.ua.ac.be/conll2002/ner/
 * ==Extraction, Transformation Loading (ETL)==
 * http://community.pentaho.com/projects/data-integration/
 * ==Other ML tools==
 * http://java-ml.sourceforge.net/ (Data manipulation, Clustering, Feature selection, Classification, Databases: Bayes, MCL, KD-tree, DTW...)
 * http://mallet.cs.umass.edu/ (document classification, sequence tagging, topic modeling, numerical optimization: LDA, GRMM, CRF...)
 * http://www.spagobi.org/ (Business Intelligence: OLAP, KPI, data visualization, geospatial analytics...)
 * http://cran.r-project.org/web/views/MachineLearning.html (all sorts of algorithms in R)