Class, many methods are defined, but many methods are not implemented, which requires us to inheritClass, implement some methods by yourself, such as: # Overridable -- handle start tag def handle_starttag (self, tag, attrs): pass # Overridable -- handle end tag def handle_endtag (self, tag ): pass # Overridable -- handle character reference def handle_charref (self, name): pass # Overridable -- handle entity reference def handle_entityref (self, name ): pass # Overridable -- handle data def handle_data (self, data): pass # Overridable -- handle comment def handle_comment (self, data): pass # Overridable -- handle declaration def handle_decl (self, decl): pass # Overridable -- handle processing instruction def handle_pi (self, data): pass
Below is my demo
:
Python 3.3.2 (v3.3.2: d047928ae3f6, May 16 2013, 00:03:43) [MSC v.1600 32, >>> ================================== RESTART ======== ==========================================>>><! Doctype html public>
html.parser HTML_FILE = HTML_STR = ( ( ( ( ( parser = MyHTMLParser(strict = content = ( with open(path, line content += ( HTML_FILE = HTML_STR = ( ( * 50 == main()