Natural Language Processing Scalanlp-set of machine learning and numerical computing Libraries
Breeze-numeric processing library for Scala
Chalk-natural language processing database.
Factorie-a deployable probabilistic modeling toolkit that uses the scala software library. It provides you with a concise language to create a graph of relational factors, evaluate parameters, and deduce them.
Data analysis/Data Visualization Mllib in Distributed Machine Learning Library under Apache spark-Spark
Scalding-cascading Scala Interface
Summing bird-streaming mapreduce with scalding and storm
Abstract Algebra tool of algebird-Scala
Xerial-Scala data management tool
Simmer-Unix filter that simplifies your data and performs algebraic Aggregation
Predictionio-machine learning servers for software developers and data engineers.
Bidmat-CPU and GPU accelerated matrix library that supports large-scale exploratory data analysis.
General Machine Learning Conjecture-scalable machine learning framework under scalding
Decision tree tool under brushfire-scalding.
Ganitha-Scalding-based Machine Learning Library
Adam-use Apache Avro, Apache spark, and parquet genome Processing engines, with a dedicated file format, Apache 2 software license.
Bioscala-library of bioinformatics available in Scala Language
Bidmach-machine learning CPU and GPU acceleration library.
Figaro-A Scala library for constructing probabilistic models
Scala Machine Learning