This package can be used to parse HTML and build a DOM structure. It can parse an HTML document from a file or a string. It return a document object that can be used to traverse the document tags and data elements using functions defined in the DOM specification, such as getElementById, getElementsByTagName, getElementsByClassName, getElementsBySelector, etc..