Some time ago, the work needs to do a statistical analysis system for video quality, each end (PC, mobile and web) to put the video quality data in an HTTP request to the server, the server to parse the data, sorted from different dimensions to do real-time and offline analysis. (PS: This kind of work should be done by the statistical department, but because of various reasons fell on my head, the specific reason is slightly not said ... )
KNN works by: There is a collection of sample data, also known as the training sample set, and each data in the sample set has a label, that is, we know each data in the sample set corresponding to the classification of the respective relationship. After entering new data without a label, each feature of the new data is compared to the feature in the sample set, and then the algorithm extracts the category label of the most similar data (nearest neighbor) in the sample set. How to understand this sentence, now we think, if there are a lot of dogs in the square, these dogs are a female dog with a group of puppies, all varieties have. Every dog knows who his mother is. But there is a dog drink not to mind the water, do not know where their mother, how to find his mother. Then we'll compare the characteristics of the dog with those of the puppies. Then take the most similar dog, then his mother is the single dog's mother ~ ~ We can imagine that a Chihuahua must be far away from Teddy.
Equivalent to using these three attributes, representing a person. Different people, three attribute values are different. Use vectors [Feature1, Feature2, Feature3] to represent an appointment object. The result of the appointment, there are three kinds of possible: not satisfied, also can, very satisfied. Use class to represent the result of the appointment. In this way, each history date record can be expressed as a vector [Feature1, Feature2, Feature3, class], where:
Feature1: Number of miles flown per day
Feature2: The time spent playing games per week
Feature3: How much ice cream you can eat every week
Class: Date Result
Recently busy preparing for the school recruit related review, so also collated a bit of last semester when the lessons learned some knowledge. I just found that there was also an article similar to the nature of the literature review, which is posted here. Subject matter is about big data, is also a hot topic, although the current contact with the big data is not very related to the project, may not be engaged in this aspect of the work.
When slack means that progress stops means being eliminated.