The day before yesterday, I talked about the despair of original content and continued today to talk about original content identification and some whimsical ideas about the punishment of plagiarism.
Seeking legal means to protect the original legal basis is any work copyright is owned by the original, regardless of whether the copyright is registered. The unauthorized reproduction of others, whether or not to retain the name of the original author link, are infringing copyright.
I wonder if China has a copyright registration department similar to the United States, the content of the website can also be registered copyright protection content. As mentioned earlier, copyright registration is not necessary, it is an option. Even if not registered, the copyright of the works is owned by the original, protected by law.
From this, the search engine is not possible to adopt a copyright registration system? Simply put, the originator registers the content with the search engine after the content is complete. Since the copycat cannot be registered with the author in any way, the creator must theoretically have the opportunity to register before all the copycats.
The implementation of the technical mechanism may be similar to the ping mechanism in the blog, your article URL ping to the search engine database, search engine immediately crawl content archive. The same content, who first ping to the database, who is original.
Of course, there are a lot of technical problems to be solved, such as it seems that only blogs have this automatic ping mechanism. To take this approach, all CMS systems are required to support this feature. Static Web pages also have some way to implement this function, such as adding plug-ins on the server, which requires server software support.
For example, ping speed must be very fast, search engines have strong technical support in order to deal with these signals in a timely manner. We all know that it takes a long time for search engines to traverse Web pages. Original content Although certainly not all pages so much, but the number will be quite amazing. If the search engine does not respond in real time, there may be errors in the middle. Many web sites are generated by feeds from other Web sites, and once the original content is released, the site will soon be collected or copied to produce the content.
This way can only handle the content of the same plagiarism, compared to two Web pages, the original has been recorded in the database, other general the same content, all deleted. But encountered some people change the title, the ground, here change two words there to change two words, this time how to do? This seems to be back to the point of judging the content of replication.
The
is also a key point is the need for chaotic use of heavy. If the search engine's punishment for plagiarism is limited to reminding them to delete infringing Web pages, or not to include infringing Web pages, it will certainly not be a good place to go. If copied and illegally reproduced under the heavy hand, where there is such behavior of the site, all the station deleted, never included, who also dare to copy or illegally reproduced? Of course, if so, the Internet on the remaining site is not much.
The content source of this page is from Internet, which doesn't represent Alibaba Cloud's opinion;
products and services mentioned on that page don't have any relationship with Alibaba Cloud. If the
content of the page makes you feel confusing, please write us an email, we will handle the problem
within 5 days after receiving your email.
If you find any instances of plagiarism from the community, please send an email to:
info-contact@alibabacloud.com
and provide relevant evidence. A staff member will contact you within 5 working days.