Meta Information Extraction

The current issue of the IRW newsletter will arrive to subscribers’ inboxes over the weekend. In this issue, we examine how to extract hidden information through dashboard technology:

“Every HTML document contains hidden meta information (i.e., information about information) that is usable to businesses and average users. This information can be either structured or unstructured.

Structured data can be extracted from the Document Object Model (DOM) by processing its markup tags; for instance, by extracting its meta tags. In the case of unstructured data this type of information is accessible through statistical analysis and other forms of math analysis.”

Enjoy it.

Advertisements