MongoDB ushered in the primary data analysis function

Source: Internet
Author: User
Keywords Usher analytic function direct data scientist we

To make it easier for everyone to introduce analytics into their large data storage systems, Pentaho today announced that the latest version of its Business analytics and data integration platform has officially entered the general phase.

The Pentaho 5.1 is designed to provide a bridge between the "data and analysis two separate realms" to support all Pentaho users-from developers to data scientists to business analysts. Pentaho 5.1 provides an analytical mechanism for running the MONGODB data storage system without using code, and uses the new Data Science Toolkit as a personal helper for relevant professionals. In addition, the new version can fully support the Apache Hadoop 2.0 Yarn Architecture for resource management.

"The new capabilities of Pentaho 5.1 can support our next strategic plan, including the most difficult data analysis speed enhancements, simplification and accessibility improvements," Pentaho, executive vice president and chief product officer Christopher Dziekan. "With Release 5.1, Pentaho has been able to further implement large-scale response analysis, not only to meet the real needs of large, data-driven enterprises, but also to bring small and medium-sized businesses and emerging vendors a fair environment to compete with traditional giants, even without professional development teams, Everyone can also try their skill on the big data stage. ”

Data integration platform makes MONGODB data native analysis possible

The previous version of the Pentaho platform has allowed users to integrate it with the MongoDB, using the latter as a data source and providing reports on MONGODB data. Now the new version of Pentaho is going further, directly bringing the native analysis mechanism to the data in the MongoDB and eliminating the need to involve electronic transport layer processing or coding operations. MongoDB data sets can deliver analysis directly at source, reducing the time consuming to obtain conclusions and the requirements for user expertise.

Dziekan points out that the healthcare cost solution provider Multiplan currently has about 900,000 medical providers as its partners, with more than 40 million transactions to be processed each year. Dziekan points out that Multiplan company gets the JSON source file from its own portal and stores it in MongoDB. They are using the Pentaho Analyzer plug-in, a set of drag-and-drop operational OLAP viewing tools, based on MongoDB, designed to split data into detail and create related dashboards and reports.

"Traditional RDBMS (relational database management system) analysis mechanisms are often very complex and awkward and clumsy when dealing with semi-structured or unstructured data," said Chris Palm, chief Software Architect at Multiplan. "Pentaho 5.1 platform can satisfy this kind of market demand, allow the user to realize the data analysis work directly inside MongoDB." We have seen a more accurate analysis of the new version, and this is no longer a serious limitation of the inability to handle all the data. We can now incorporate a more complete dataset into the analysis category, allowing us to get a more comprehensive analysis of our recording systems. ”

Data scientists welcome personal assistants

Pentaho also incorporates a new Data Science Toolkit in Pentaho 5.1 to make it easier for users to perform data analysis tasks, and to help data scientists quickly build a 360-degree comprehensive customer perspective and data source mix, including social networking and MongoDB. The toolkit adds a R script executor to the Pentaho Data Integration (PDI) feature, allowing users to use R scripts as part of the PDI conversion process, greatly simplifying the burden of data preparation. The toolkit also introduces the Weka scoring tool, which allows users to use classifications, clustering, and regression models. In addition, it adds Weka predictions to help users create time series analysis and predictive environments in Weka using predictive models.

"Data scientists are equivalent to getting their own personal assistants," Dziekan said. "The Data Science Toolkit provides a wide range of tools that can be used directly and are familiar to data scientists, and we are now able to manipulate them for their own service." ”

The Pentaho 5.1 platform also incorporates complete yarn integration capabilities, making it easier for developers to leverage the Pentaho data integration capabilities to fully leverage Hadoop's powerful computing power without writing complex mapreduce code. Dziekan says the addition of the yarn support capability allows PDI operations to use Hadoop resources in an elastic fashion, expanding and shrinking in accordance with the changes in data size and processing requirements. He also pointed out that the support of the yarn Advanced resource management function can integrate a variety of workload scenarios, thus bringing the persistent data conversion and analysis mechanism that the users have long desired.

Original link: http://www.cio.com/article/2375115/business-intelligence/native-data-analysis-comes-to-mongodb.html

Related Article

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.