Work‎ > ‎Projects‎ > ‎

SIGMA - Large Scale Machine Learning Toolkit

posted Oct 23, 2011, 9:13 PM by Botao Hu   [ updated Oct 24, 2011, 10:50 PM ]
Core Developer
Mar 2011 - Jun 2011
Mentored by Weizhu Chen
Microsoft Research Asia, Beijing

The goal of this project is to provide a group of parallel machine learning functionalities which can meet the requirements of research work and applications typically with large scale data/features. The toolkit includes but not limited to: classification, clustering, Ranking, statistical analysis, etc and makes them run on hundreds of machines, thousands of CPU cores parallel. We also provide a SDK for researchers/developers to invent their own algorithms and accumulate them into the toolkit.