Tags
Updating Several Miners
09 Friday Jun 2017
Posted Data Mining, Human-Computer Interaction, Programming, Quantum Computing, Spam
in09 Friday Jun 2017
Posted Data Mining, Human-Computer Interaction, Programming, Quantum Computing, Spam
inTags
12 Monday Sep 2016
Tags
Algorithms, bioinformatics, chemical mining, chemistry, Data Conversion, data miners, Data Mining, information retrieval, ir, minerazzi, miners, mining, news, social mining, statistics, tools, tutorials, Vector Space Models
We have expanded the number of similarity measures that our Binary Similarity Calculator computes from 30 to 72 (and counting…)
Same measures with different names have been consolidated into a single record, and different measures with same name have been enumerated as necessary.
These similarity coefficients have many applications across disciplines: from bioinformatics to chemistry, chemometrics, statistics, data mining, information retrieval, marketing research, etc.
The tool is available at
http://www.minerazzi.com/tools/similarity/binary-similarity-calculator.php
We have also included the new similarity measures proposed by Consonni & Todeschini (2012), and Todeschini, et al (2012).
Our Tutorial on Distance and Similarity was also updated, accordingly. Check it out at
References
Consonni, V. and Todeschini, R. (2012). New Similarity Coefficients for Binary Data. MATCH Commun. Math. Comput. Chem. 68, 581-592.
Todeschini, R., Consonni, V., Xiang, H., Holliday, J., Buscema, M., and Willet, P. (2012). Similarity Coefficients for Binary Chemoinformatics Data: Overview and Extended Comparison Using Simulated and Real Data Sets. J. Chem. Inf. Model. 52 (11).
28 Thursday Jul 2016
Tags
Local Term Weight Models from Power Transformations
Development of BM25IR: A Best Match Model based on Inverse Regression
In this article we show how power transformations can be used as a common framework for the derivation of local term weights. We found that under some parametric conditions, BM25 and inverse regression produce equivalent results. As a special case of inverse regression, we show that the largest increment in term weight occurs when a term is mentioned for the second time. A model based on inverse regression (BM25IR) is presented. Simulations suggest that BM25IR works fairly well for different BM25 parametric conditions and document lengths.
Source: http://www.minerazzi.com/tutorials/bm25ir.pdf
14 Thursday Jul 2016
Posted calculators, Data Conversion, Data Mining, News, One-to-Many (O2M), Programming, Scripts, Software
inEnergy Converter is a new data conversion tool, available at
http://www.minerazzi.com/tools/energy/converter.php
Easily convert energy units or compute energy oil, coal, & natural gas equivalents and more with this one-to-many (O2M) mapping tool. Just input a value and press the Enter key.
To browse a comprehensive list of data conversion and extraction tools, visit
11 Monday Jul 2016
Tags
banks, cardless atm, data miners, Data Mining, digital wallets, miner, minerazzi, miners, mining, retail banking, tools
Retail Banking is a new Minerazzi miner, available at
http://www.minerazzi.com/banking/
Find products, services, and companies relevant to card or cardless ATM software, digital wallets, mobile payments, payment service providers, and more with this new miner. Search by technologies or keywords.
Recrawl a search result to find additional resources or build your own curated collection.
For additional topic-specific miners, visit http://www.minerazzi.com
13 Monday Jun 2016
Posted calculators, Data Conversion, Data Mining, Miscellaneous, News, Programming, Software
inThis is a new data conversion tool, available now at
http://www.minerazzi.com/tools/mass/converter.php
No need to mess over and over with annoying pull-down menus.
Just input a value and press Enter key. The tool then easily converts all kind of mass units at once, allowing you to save time and efforts.
Supports SI, Avoirdupois, Troy, Apothecaries units, and more.
The tool uses the same design pattern algorithm that powers our Length Converter tool at
http://www.minerazzi.com/tools/length/converter.php
24 Tuesday May 2016
This tool is available at
http://www.minerazzi.com/tools/fcrdns/lookups.php
The tool allows you to do Forward and Reverse DNS lookups. Given a host name, the tool finds its IP. Conversely, given an IP the tool finds the corresponding host.
Forward DNS lookup resolves a host name to an IP address (A record). The process of reverse resolving an IP address uses the pointer DNS record type (PTR record).
Thus, the tool does Forward-confirmed reverse DNS (FCrDNS) lookups. This is a networking parameter configuration where a given IP address has both forward (name-to-address) and reverse (address-to-name) Domain Name System (DNS) entries that match each other.
Unlike similar tools which do Forward/Reverse DNS lookups on a single host, our tool does lookups on multiple hosts, saving users time and effort.
To use the tool, enter one host name (or IP) per line, ending each line by pressing the Enter key.
Forward DNS lookups are faster than Reverse DNS lookups so for the latter you may want to do a few checks at once.
Depending on DNS server configurations, lookups with or without the www alias can produce dissimilar results. For instance yahoo.com with and without www returns different results.
Applications
Our tool can be used to identify Internet service providers (ISPs) who do not provide properly matching DNS and rDNS records. It can also be used to find shared hosting and, when misconfigured, forwarders information leaks.
FCrDNS verification can also be used for whitelisting purposes because spammers and phishers cannot usually by-pass this verification when they use zombie computers for email spoofing. That is, the reverse DNS might verify, but it will usually be part of another domain than the claimed domain name.
20 Friday May 2016
Tags
We have developed a new tool that simplifies Z-to-P and P-to-Z Transformations. It is available at
http://minerazzi.com/tools/phi/transformations.php
Unlike similar tools that handle one input score at a time, our tool computes Z-to-P and P-to-Z transformations over an entire set of input scores, saving users time and effort.
The tool facilitates the work of data miners, statisticians, or anyone that need to compute Z and P scores without having to consult Z statistical tables.
It is a great tool for students and teachers interested in Statistics.
02 Monday May 2016
Tags
crowdsourcing, data miners, Data Mining, freelancing, information retrieval, minerazzi, miners, news, text mining, tools
For crowdsourcers and freelancers:
This is a new minerazzi.com miner, available at
http://www.minerazzi.com/crowd
Find work-for-hire jobs and remote employment opportunities. Search by crowdsourcing and freelancing companies, projects, or expertise area.Be hired!
28 Thursday Apr 2016
Tags
Algorithms, data miners, Data Mining, information retrieval, ir, Mind Retrieval, minerazzi, tools
We are getting closer to Mind Retrieval. The implications of being able to mine the brain are obvious for all sciences, in addition to homeland security, law and order, marketing research, etc.
I got last night this news, “Scientists map brain’s ‘thesaurus’ to help decode inner thoughts
Scientists at the University of California, Berkeley, have taken a step in that direction by building a “semantic atlas” that shows in vivid colors and multiple dimensions how the human brain organizes language. The atlas identifies brain areas that respond to words that have similar meanings |
Read the news here: http://www.nsf.gov/news/news_summ.jsp?cntn_id=138437&WT.mc_id=USNSF_51&WT.mc_ev=click
Last year I mentioned that we are getting close to Mind Retrieval.
https://irthoughts.wordpress.com/2015/06/17/say-hello-to-mind-retrieval/
That post was a reminder of a previous 2010 interview by Nuno Valenzuela, a visionary SEM from Spain. Great guy.
I met Nuno back in 2007 when I was invited to present at a Madrid Search Engine Congress (OJOBuscador) on Latent Semantic Indexing (LSI).
See conference legacy links here
http://web.archive.org/web/20071213063317/http://congreso.ojobuscador.com/2007-madrid/ponentes/
http://web.archive.org/web/20071013183824/http://congreso.ojobuscador.com/2007-madrid/multimedia/
Here is a link to Nuno’s interview. You may want to resize browser window:
https://ithinksearch.wordpress.com/2010/08/10/ideas-entrevista-a-edel-garcia/
And some relevant links here:
https://irthoughts.wordpress.com/2009/06/22/ir-videos-in-spanish/
https://irthoughts.wordpress.com/2008/04/07/demystifying-lsi-video/
Unfortunately, OJOBuscador site is now defunct so their links are broken.