This class can be used to parse and analyze HTML documents. It used the DOM document extension to parse HTML documents. The class can also analyze the parsed documents and return the document encoding, DOCTYPE, base URL, meta tags, title, links, images, document text, words in tags h1, title, a, a title, img alt, words weights and density.