The Extended Boolean Model for Information Retrieval

The Extended Boolean Model for Information Retrieval. This is an IR tutorial I wrote circa 2006 (http://www.minerazzi.com/tutorials/term-vector-6.pdf). It may be … More

Vector Space Explorer Tool

Vector Space Explorer Tool is a new tool from Minerazzi, available now at http://www.minerazzi.com/tools/vector-space-explorer/explorer.php VSE is aimed at exploring combinations … More

Hybrid Similarity Search (HSS) Algorithm for Chemistry Searching for Fentanyl-related compounds and other drugs. Free version: https://www.mswil.com/images/NIST/NIST17/GCMS-Hybrid-Search-AnalChem-2017.pdf This is a … More

Here is a python-based search engine with an implementation inspired on one of our papers at the old Mi Islita.com … More

We have expanded the number of similarity measures that our Binary Similarity Calculator computes from 30 to 72 (and counting…) … More

An updated version of the BM25IR paper is available at Click to access bm25ir.pdf Essentially we corrected few typos and … More

Local Term Weight Models from Power Transformations Development of BM25IR: A Best Match Model based on Inverse Regression In this … More

Just a reminder on how we can model good keywords: Poisson Mixtures: Poisson mixtures fit the data better than standard Poissons … More

Our 2006 legacy tutorial on the Extended Boolean Model is back, with its content edited and updated. It is available … More

This is Part 4 of a tutorial series on Term Vector Theory. An introduction to several local weight models is … More