Improving MUST


, , , , ,

We have tweaked MUST (Minerazzi URL Scoring Tool) to run a bit faster.

Try it with modern socials like

or with all those Old Glory Days URLs from the Search Engines Golden Age (the 90s). Check if they are defunct, redirecting, or still active.

AAAHHH: All those old days with their dumb business models.🙂

Social Pulse Parser: New Categories Added


, , , , , , , , , ,

The Social Pulse Parser (SPP) has been expanded to include the following categories.

World News (19 rss)

Technology (25 rss)

Data Mining (14 rss)

Search Engine Marketing (9 rss)

Social Media Marketing (12 rss)

Government (6 rss)

Organizations (6 rss)

We expect to track additional categories and rss resources across more social networks

Try the SPP now at



BM25IR (Updated Version)


, , , , , , , , ,

An updated version of the BM25IR paper is available at

Essentially we corrected few typos and added one line to explain even more the transformation of (a) into (b) in Figure 1 of the paper.

Other than that, its content and thesis remains the same.

For the sake of transparency, we keep the older version accessible through the following services:








Artificial Neurons, IBM, and Mind Retrieval


, , , , ,

IBM researchers have created artificial neurons and synapses.

Very relevant to mind retrieval, in many ways.

Social Pulse Parser Upgrade


, , , , ,

We have upgraded the Social Pulse Parser (SPP) from a mere tool to a full section of Minerazzi. Organized by categories, SPP reports the sentiment generated by news across social networks.

So far SPP reports on the following categories:

  • World News
  • Government
  • Organizations
  • Technology

Additional categories will soon be added.

The following socials are covered so far: Facebook, Linkedin, Stumbleupon, Pinterest, and Google+. Apparently Twitter counts no longer can be parsed as easy as before so we remove them from the list.

We welcome any suggestion or feedback.


BM25 and Power Transformations: Introducing BM25IR


, , , , ,

Local Term Weight Models from Power Transformations
Development of BM25IR: A Best Match Model based on Inverse Regression

In this article we show how power transformations can be used as a common framework for the derivation of local term weights. We found that under some parametric conditions, BM25 and inverse regression produce equivalent results. As a special case of inverse regression, we show that the largest increment in term weight occurs when a term is mentioned for the second time. A model based on inverse regression (BM25IR) is presented. Simulations suggest that BM25IR works fairly well for different BM25 parametric conditions and document lengths.





Chemistry Resources


, , , , , , , , , , ,

The City College Chemistry Web Resources Page, is a great chemistry resource and one that we highly recommend to students, teachers, and researchers alike.

We are happy to learn that they have added to the Computational category our Hydrocarbons Parser

Not trying to compete with such a large chemistry resource site, but we want to let you know that we will soon be added several new chemistry tools to the section, some inspired in the CUNY resource page, and others oriented to data retrieval.

The following is a list chemistry tools that we have so far developed.

And here is a list of chemistry miners:

Have a nice chemistry day.🙂

Energy Converter Tool


, , , ,

Energy Converter is a new data conversion tool, available at

Easily convert energy units or compute energy oil, coal, & natural gas equivalents and more with this one-to-many (O2M) mapping tool. Just input a value and press the Enter key.

To browse a comprehensive list of data conversion and extraction tools, visit