10 most popular machine learning and data Science python libraries

Source: Internet
Author: User
Tags theano numba keras



2018 will be a year of rapid growth in AI and machine learning, experts say: Compared to Python is more grounded than Java, and naturally becomes the preferred language for machine learning












In data science, Python's grammar is the closest to mathematical grammar, making it the easiest language for professionals such as mathematicians or economists to understand and learn. This article will list the top ten most useful Python tools for machine learning and data science applications









Machine learning Tools






1, Shogun






Shogun is a machine learning toolkit that focuses on support vector machine (SVM) Learning Toolkit. Written in C + +, which was created as early as 1999, it is one of the oldest machine learning tools, providing a wide range of unified machine learning methods designed to provide transparent and accessible algorithms for machine learning, and to provide free machine learning tools for anyone interested in this field.






Shogun provides a well-documented Python interface for unified large-scale learning and provides high-performance speed. However, the disadvantage of Shogun is that its API is difficult to use. (Project address: Https://github.com/shogun-toolbox/shogun)






2, Keras






Keras is a high-level neural network API that provides a Python deep learning library. For any beginner, this is the best choice for machine learning because it provides a simpler way to express neural networks than other libraries. The Keras is written in pure Python and is based on the TensorFlow, Theano, and cntk back end.






According to the official website, Keras focuses on 4 key guidelines, namely user-friendliness, modularity, extensibility, and collaboration with Python. However, in terms of speed, Keras is relatively weak. (Project address: Https://github.com/keras-team/keras)






3, Scikit-learn






Scikit-learn is a Python machine learning project. is a simple and efficient data mining and data analysis tool. Built on NumPy, SciPy, and Matplotlib. Scikit-learn provides a consistent and easy-to-use API grid and random search. Its main advantage algorithm is simple and fast. The basic functions of scikit-learn are mainly divided into six parts: classification, regression, clustering, data dimensionality reduction, model selection and data preprocessing (Project address: Https://github.com/scikit-learn/scikit-learn)






4. Pattern






Pattern is a Web mining module that provides tools for data mining, natural language processing, machine learning, network analysis, and network analysis. It also comes with complete documentation, with more than 50 examples and over 350 unit tests. The most important thing is that it is free! (Project address: Https://github.com/clips/pattern)






5, Theano






Theano is one of the most mature python deep learning libraries, Theano is named after the wife of the Greek Pythagorean philosopher and mathematician Pythagoras, Theano's main function: tightly integrated with numpy, defining the results you want with symbolic language, The framework compiles your program to run efficiently on the GPU or CPU.






It also provides tools for defining, optimizing, and evaluating mathematical expressions, and a number of other libraries can be built on Theano to explore their data structures. Nevertheless, there are some shortcomings in the use of Theano; For example, it may take a long time to learn its API, while others think the compilation time of the Theano large model is not efficient enough (Project address: Https://github.com/Theano/Theano)









Data Science Tools






1, SciPy






SciPy (pronounced "sigh Pie") is an open source mathematical, scientific, and engineering computing package. SCIPY uses various software packages such as Numpy,ipython or pandas to provide libraries for common mathematical and scientific programming tasks. This tool is a good choice when you want to manipulate numbers on your computer and display or publish results, and it's also free. (Project address: https://github.com/scipy/scipy)






2, Dask






The dask is a flexible parallel computing library for analytical computing. Similarly, by changing only a few lines of code, you can quickly parallelize existing code because its dataframe is the same as in the Pandas library, and its array object works like NumPy can parallelize and write in pure python. (Project address: Https://github.com/dask/dask)






3, Numba






This tool is an open-source optimization compiler that uses the LLVM compiler infrastructure to compile Python syntax into machine code. The main advantage of using Numba in data science applications is that it uses the NumPy array to speed up the application's capabilities, because Numba is a compiler that supports numpy. Like Scikit-learn, Numba is also suitable for machine learning applications. (Project address: Https://github.com/numba/numba)






4, Hpat






The high-Performance Analysis Toolkit (HPAT) is a compiler-based big data framework. It automatically expands the analysis/machine learning code in Python into Big data analytics and machine learning in a clustered/cloud environment, and can use the @jit adorner to optimize specific features. (Project address: Https://github.com/IntelLabs/hpat)






5, Cython






Cython is your best choice when working with code that runs in a mathematical or password loop. Cython is a Pyrex-based source code translator that quickly generates a Python extension module (extention module) tool. The Cython language is very close to the Python language, but Cython also supports calling C functions and declaring C types on variables and class properties. This allows the compiler to generate very efficient C code from the Cython code. (Project address: Https://github.com/cython/cython)






Benefits: May 10 (Thursday) Eight o'clock in the evening: "Live Online" An introduction to the method of cross test for---of sorting and evaluating artifact





Registration method: identify the promotional map QR code, the successful landing site immediately after registration!






Follow the public account



" Pegasus "














past-term benefits concerned about the Pegasus public number, reply to the corresponding keywords package download learning materials; reply to "into the group", join the Pegasus network AI, Big data, Project Manager Learning Group, and excellent people grow together!


Reply to the number "1" download from Getting started to research, the field of AI most worth reading 10 of this information (with download)



Reply to the digital "2" machine learning & Data Science must-read classic books with a kit!



Reply to number "3" into AI & ML: From basic statistics to machine learning book list (PDF download included)



Reply to the number "4" understanding AI, 30 book list not to be missed (with electronic version pdf download)



Reply to the number "5" Big Data learning materials download, beginner's Guide, data analysis tools, software use tutorial



Reply to the number "6"ai Artificial Intelligence: 54 Industry Heavyweight report summary (download included)



Reply Number "7"tensorflow Introduction, installation tutorial, image recognition application (with installation package/guide)



Reply to the number "8" Big Data Full Analysis (352 cases + Big Data trading white Paper + domestic and foreign Policy Exchange)



Reply to the number "9" Dry Foods | Recommend reading 10 Big Data Books (Beginner/Intermediate/Advanced) become big Data Expert!



Reply to the number "ten" 160-page report: 2030 Global will probably 800 million people to be the machine to rob the job



Reply to the number "one"50 book Package: AI AI/Big Data/database/linear algebra/python/machine learning/hadoop



Reply number "a" small white | python+matlab+ machine learning + deep neural Network + theory + Practice + video + Courseware + source code, attached download!



Reply to digital " Big Data" technology tutorials + Books +hadoop video + Big Data research + Popular science books



Reply number "+" small white | Machine learning and deep learning must read books + machine learning combat video/ppt+ Big Data analysis books recommended!



Reply Digital " Big Data" Hadoop technology ebook + technical theory + actual combat + Source code Analysis + expert share PPT






Reply to the number "100G" python from beginner to proficient! Self-study the necessary full set of video Tutorials +python Classic books!



Reply to the number"" "" dry "31 articles on deep learning must-read paper summary (attached thesis)



Reply to the number "526"Industry reports + White paper: AI, Robotics, Smart travel, smart home, Internet of Things, vr/ar, blockchain, etc. (download included)



Reply to the digital "800G"ai learning materials: AI ebook +python language introduction + tutorials + machine learning and other time-limited free collection!



Reply to ai download "FMI speech ppt for AI and Big Data Summit"



Reply ai lake download "Ten AI lakes and rivers Field"



Reply to ml practice Download "Machine Learning Practical Experience Guide (English version)"



Reply to DL paper download "deep learning more than 100 articles"



Recovery algorithm download "Data Mining ten classic Algorithms"



Reply 6.10 download "6.10 Hungry & Flying Horse Network project management practice PPT"









10 most popular machine learning and data Science python libraries


Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.