The Standardizer, originally thought of as an x-to-z score converter -hence its name-, is now a full-fledged statistical tool for univariate analysis. Check it out now at http://www.minerazzi.com/tools.
This tool transforms a data set into z-scores and one/two-tail percentiles.
The tool also computes central tendency and dispersion measures like means, medians, standard deviations, variances, coefficients of variation, and ranges.
Available now at http://www.minerazzi.com/tools
The Intelligence, Security, and Assurance (ISA) collection is a new miner built with Minerazzi (http://www.minerazzi.com).
Use it to find resources relevant to information intelligence, security, and assurance.
Search by software tools, companies, and government agencies, or by graduate school programs offering courses on these subjects.
Some times you need to deduplicate or sort alphabetically a set of items delimited in some way. Perhaps you just want to remove those items that match specific terms or strings.
Items can be email addresses, phone numbers, links, urls, names, keywords, etc. These might be delimited by lines (\n), tabs (\t), spaces, commas, colons, semicolons, or periods.
We have developed a tool that allows you to edit these types of sets, precisely. Just submit a data set, select how it is delimited, and chose few edit options.
Enjoy it :)
Our old Color Miner tool is now available at
This is a tool that generates fractalettes.
We define a fractalette as a color palette within a color palette. These types of fractal-like arrays allows you to investigate color-color, color-space, and space-space relationships.
To use it, just submit an absolute URL, complete with its http(s) scheme.
Enjoy it. :)
A short tutorial on the Levenshtein Distance is available now at
Did you know that Levenshtein Distance is at the heart of sequence analysis and text mining-based technologies? It is so simple, elegant, and relevant to many research fields.
The Levenshtein Distance Calculator is back. This tool was removed from our old site, but now is available at
This is a visual and interactive tool great for sequence analysis, text mining, and teaching. A tutorial listing practical applications will soon follow.
The first of a series of companion tutorials for some of our tools is available now at http://www.minerazzi.com/tutorials/index.php
In this tutorial we present a general overview of two association measures used in data mining and information retrieval: distance and similarity. To learn the difference between the two, visit
The Library Recrawl Project (http://www.minerazzi.com/lrp) is a new miner built with Minerazzi. It allows users to recrawl all top World libraries, their catalogs and information gateways. Users can search inside results and uncover vast amounts of resources. To search, users can enter a library name, keywords, or country.
Query [ library of congress ], [ world catalog ], [ national archives ], [ national libraries ], [ public libraries ], [ state libraries ], etc… Or if you prefer, search by country.
By recursively searching inside results with our Search Inside tool, you will be discovering entry points to vast amounts of new resources ( libraries, catalogs, etc) . Have fun.
Query [ z3950 ] to find libraries across the World using the old z39.50 search implementation.
Wikiminer is a new miner built with Minerazzi ( http://www.minerazzi.com/wikiminer ) exclusively for mining Wikileaks.
It allows users to find secret information, news leaks, and classified media from anonymous sources by mining Wikileaks link graph. Search by keywords or location.
Query [ cablegate ] in this miner. Then locate the result whose URL is https://wikileaks.org/cablegate.html and click the Search Inside tool icon below said result. By recursively searching inside a result you will be walking a portion of Wikileaks link graph.