TLD Vision Tracking algorithm

Source: Internet
Author: User
Tags tld

Source: TLD Vision Tracking algorithm

TLD algorithm good one, here is a video, the author shows the effect of the algorithm, http://www.56.com/u83/v_NTk3Mzc1NTI.html. Below this CSDN blog has someone to do the related summary, feel very good, collection! Here is a compressive tracking URL, the code provided is very small, but real-time is very good, matlab code down can be used.

The following blog post is transferred from: http://blog.csdn.net/windtalkersm/article/details/8018980

TLD is the abbreviation of an algorithm, which is called tracking-learning-detection by the original author. Visual people see this name will be startled, very ambitious plan. It was a 09 job, not too long, but not too new. There are a lot of resources on the internet about this, much to do with the author's Open source code.

The first problem encountered in the learning process is that there are too many resources---of course it is relative to this field, generally can find a faithful reproduction algorithm of the source code is already very good. So to find the list, although a bit of a waste of time, I hope that can be helpful to others. Specific details are not much to say, there are a lot of great analysis is listed below, such as zouxy09 write the source code comments, can not be more detailed. If hard to pick a fault, it is a large paragraph of the text makes people dizzy, also did not how typesetting. I would like to draw a few simple pictures to add, I do not know what good point of the drawing program recommended (latex, or GNUPlot?)

Source Code Resources:

1. Original author Zdenek Kalal

Author's homepage: http://info.ee.surrey.ac.uk/Personal/Z.Kalal/

Source code page: Https://github.com/zk00006/OpenTLD

Programming language: Matlab + C

2. Alan Torres Edition

Source code page: Https://github.com/alantrrs/OpenTLD

Implementation language: C + +

3. Version Arthurv

Source code page: Https://github.com/arthurv/OpenTLD

Implementation language: C + +

Note: And none of the above found any difference

4. Version Jmfs

Source code page: Https://github.com/jmfs/OpenTLD

Implementation language: C + +

Note: There is no difference from the above two, only joined the VS2010 project file, in theory can be directly under the Windows compiler. But OpenCV can't detect the author's webcam (!!!) ), so he used another Videoinput class to handle the camera input.

This is a adaptation of arthurv ' s fork of Opentld (Https://github.com/arthurv/OpenTLD) to being immeadiately runnable in Vis UAL Studio 2010.

5. Georg Nebehay Version (finally a different one ....) )

Source code page: http://gnebehay.github.com/OpenTLD/

Note 1: The benefit of this is to provide executable file downloads (Ubuntu 10.04 and Windows). But, as you would expect, basically on your machine can not run. Just build it yourself.

Note 2: This version requires QT installation. But it seems that the author turned off the QT option (the associated code is still there), so it compiles, but the results are not displayed.

Note 3:CSDN download has a "opentld Qt version", this is the version. But adding a project file for vs---still not pnp on my machine, don ' t bother

http://download.csdn.net/download/muzi198783/4111915

6. Paul Nader Edition (another QT version!) )

qopentld:http://qopentld.sourceforge.net/

Source code page: http://sourceforge.net/projects/qopentld/

Note 1: OpenCV and QT are required. The original system requires QT 4.3.7OpenCV 2.2.

Note 2:windows and Linux are provided with either a compile project or a makefile. It is also the only tld! to be ported to the Android platform.

7. Ben Pryke Edition (another student project! )

Source code page: Https://github.com/Ninjakannon/BPTLD

Note: The matlab+c/c++ is still a hybrid implementation. The highlight is a very detailed documentation (8 pages), which introduces the understanding and implementation details of the algorithm. can help understand the original algorithm

Blog Resources (Chinese):

1. Discovering TLD (yang_xian521)

http://blog.csdn.net/yang_xian521/article/details/6952870

Note 1: From the article, the author is based on the MATLAB version of the original author analysis. From the function name to see above 2/3/4 should be matlab--->c++ "literal translation", function names unchanged. This is best, can be compared with the following view, while learning matlab and C + +

2. TLD (tracking-learning-detection) Learning and source Understanding (ZOUXY09)

http://blog.csdn.net/zouxy09/article/details/7893011

Note 1: With the <<ARTHURV version of the >>, said the previous, can not be detailed!

NOTE 2: The following three are from this copy of the

3. "Re-talk about PN learning":
http://blog.csdn.net/carson2005/article/details/7647519
4.--TLD Tracking algorithm for video tracking algorithms that are stronger than Microsoft Kinect
http://blog.csdn.net/carson2005/article/details/7647500
5. "TLD Vision tracking Technology Analysis"
Http://www.asmag.com.cn/number/n-50168.shtml

What you want to say:

1. Share: The previous period of time to Tld::init (...) After reading, Ben wanted to bang, other things too much had to put down. However, I am more familiar with detection and tracking, Init has already put the learning done again, understand the rest is easy. Now re-pick up, inadvertently found the Zouxy09 notes, save too much strength, half a day to see the finished. A lot of details do not have to take their own---we often complain that the resources of the document is too few, envy foreigners ability strong, quick action, and they are willing to share too much. Often see some good collection of articles, a few days to see incredibly deleted!

It may be understood that the company should be kept secret, but if the fear that others know their own ideas can not be done, it is still not in this area to do. The algorithm is only thought, and no one can monopolize it. The algorithm must be constantly updated, holding an algorithm will not survive a few years. The original author also started the company based on this technology, and did not see them based on this restriction others use. Sift,surf are patent and haven't heard of making a lot of money, Kinect tells you the algorithm can not be realized. The secret is to implement the details

2. Comparison: Finally read the realization, the overall feeling of the algorithm is more like engineering applications (engineering) rather than theoretical breakthroughs (also can not ask too much is not). It is not necessarily better than a single tracking (tracking) or detection (detection-by-classification) module, after all, that the two fundamental challenges of appearance (appearence) and scale change are not resolved. But this kind of framework should actually be very practical, because----------adjustable parameters too much!

TLD believes that many people have tried, real-time many people are complaining, and get their own video on the total need to adjust the parameters of the effect is good.

More like this year ECCV Kaihua Zhang's compressive Tracking: Theoretical advanced Scary (joking), the source of simple scary. And so far I have tried the off-the-shelf tracker in the best tracking effect, do not adjust any parameters, absolute real-time----code so little, want to not real-time is difficult (by the way, the author's blog mentioned above). This is the way to do research, there is a strong theory to support the implementation can be very simple without affecting the effect. So if the math people are willing to do the application, a lot of people will be laid off

Http://www4.comp.polyu.edu.hk/~cslzhang/CT/CT.htm

Another PWP (Pixel-wise posteriors), publish time and TLD, looks pretty good, but the author says it's open source and hasn't been cashed. is a pity. Individual feel that level set on partial occlusion effect should be very good, real-time is not difficult

Http://www.robots.ox.ac.uk/~cbibby/research_pwp.shtml

3. Summary: TLD is actually a very suitable entry and advanced algorithm:

A. Theory, high-quality paper (BMVC, CVPR, ICPR, last Pami)

B. Active Code! Matlab, C + +, Windows, Linux, ..... What else do you want?

C. A detailed introduction and detailed code comments that are shared by different bull calves (almost every line is explained)!

4. Involved in a wide range, involving detection, tracking, classifcation, traditional visual technology is so stiff division of the three major categories. After the study of each part of how much can be a bit of experience.

The End
downstairs comment to the URL is also very good AH: http://blog.csdn.net/zouxy09/article/details/7893022

TLD Vision Tracking Algorithm (RPM)

Contact Us

The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion; products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the content of the page makes you feel confusing, please write us an email, we will handle the problem within 5 days after receiving your email.

If you find any instances of plagiarism from the community, please send an email to: info-contact@alibabacloud.com and provide relevant evidence. A staff member will contact you within 5 working days.

A Free Trial That Lets You Build Big!

Start building with 50+ products and up to 12 months usage for Elastic Compute Service

  • Sales Support

    1 on 1 presale consultation

  • After-Sales Support

    24/7 Technical Support 6 Free Tickets per Quarter Faster Response

  • Alibaba Cloud offers highly flexible support services tailored to meet your exact needs.