Is JAVA necessary for data mining engineers?

Source: Internet
Author: User
I used python to implement algorithms for data mining in my statistics department. At that time, I started the tutorial "machine learning practice", which also used python. However, it was recently discovered that the recruitment requirements for data mining engineers generally involve JAVA, and the NPC data mining center also recommends that students learn JAVA by themselves. Do data mining engineers still need JAVA to master python? I used python to implement algorithms for data mining in my statistics department. At that time, I started the tutorial "machine learning practice", which also used python. However, it was recently discovered that the recruitment requirements for data mining engineers generally involve JAVA, and the NPC data mining center also recommends that students learn JAVA by themselves. Do data mining engineers still need JAVA to master python? Reply: I use Python as the main language and Scala to write Spark ML programs. Hadoop uses Pig Latin + UDF for batch processing. C/C ++ and Matlab are used for research and development, and now they are almost forgotten. After work, I also studied the front-end with great interest. HTML, CSS, and JavaScript can also be used (later Node. js is also playing ). later, I heard that Ruby is more elegant than Python. I also read song benhong's "The Future of code" and tried Ruby. I like the multiple inheritance methods of Mix-in, therefore, I will use the Mix-in method to write multiple inheritance codes for Python later. It feels good.

I am responsible for the design of the company's Big Data Platform Architecture, RTB placement and the design and development of the result tracking statistics system of mobile sdks, and the development of large-scale user portrait systems. Now I often discuss the Hadoop architecture with other Hadoop engineers in the company and Angular with front-end programmers. js and react. js framework. Discuss with backend colleagues about Tornado, Flask, Tomcat, and Play backend frameworks. Extensive learning not only makes me a data mining engineer, but also makes me a core technical engineer of the company. I already think that I am not a data mining engineer, so I changed the title to a programmer. I don't think any skill is necessary. You can be happy.

PS: I majored in learning and controlling, and I was studying robotics. Now I am mainly studying natural language processing (I am going to do AI on the tall floor !) The necessity to be familiar with JAVA is to help you build a complete set of Hadoop-related infrastructure, understand internal work methods, and various O & M tasks that cannot be avoided. On this basis, most of the statistics and recommendations can be replaced by Python.
Similarly, Spark-based application development does not need to be familiar with Scala.
In actual work, I usually hope that the recommendation team will be familiar with the underlying implementation of Hadoop and Spark, so that the problem solving capability will be much better. Language is just a tool, tool, tool !!! I am a member of the National People's Congress, but I am a newbie, and I am looking at the back of the experts to work hard.

My mentor gave us the guidance that statistical data mining has a considerable computer base, at least proficient in one language. For those of us who are not strong in programming, it is recommended that several teachers learn python well. Compared with java, it is still difficult to get started. For those who have strong programming skills, to learn java well, of course, there is no harm. After all, hadoop is implemented in java.

I don't have any experience in data mining, and I don't know what the industry thinks. I feel that @ Ji Lu's answer means that all the major roads are connected to Rome. Of course, the pace of one person may not be suitable for another person. LZ may ask more about Niu Dao and the great gods. As a cainiao interested in data mining, I would like to encourage them! Because most of apache family projects in java are not necessary, Data Mining involves a wide range of jobs and does not need to be pursued in every aspect.

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.