Learn about the power of X Searches (short for XOR and XNOR searches) for keyword discovery, disambiguation, clustering, information retrieval, and data mining in general.
This is a follow up on the Beauty of XOR and XNOR searches post, describing possible applications of these search modes to Information Retrieval, Search Marketing, and Web Mining. The post is a snippet taken from http://www.minerazzi.com/help/xor-xnor.php
An IR researcher can test the performance of an LSI algorithm with a sample of documents retrieved through XOR and XNOR searches. Said sample should be rich in co-occurrence cases. Using a similar procedure, search marketers or Web intelligence specialists can identify sets of documents that emphasize keywords somehow related through different co-occurrence paths.
An interesting application consists in extracting all the unique terms (or just the high frequency ones) from a text source and constructing an XOR query with these. We may refer to this as XORing a text source. This should help one identify a network of co-occurrence paths over a collection and which documents might be relevant to specific combination of terms from the original source.
The text source can be a title, description, abstract, or paragraph of a document, or even an entire document. However, XORing a large document might be computer-intensive.
A similar exercise can be done by XNORing a text source. In both cases, the resultant output can be used to identify prospective competitors; i.e., documents relevant to similar concepts or belonging to companies within the same business space.
We are currently testing the XOR and XNOR search modes as a query disambiguation strategy.
PS. Today, 1-9-2014, we added new material that discusses these search modes for disambiguation and clustering.
That’s the beta test phase we are in at Minerazzi (http://www.minerazzi.com). This time we are testing some nice tools.
Beta test phase 3 of Minerazzi (http://www.minerazzi.com) is now open. A new search interface with 16 search modes and few new tools will be tested. We have added a new tool that allows users to generate matrices from almost any type of analytics. Immediate applications to search result pages (serps) like in the form of keyword matrices, search mode count matrices, etc are possible.
When it comes to properly mean “powered by” in Spanish web pages, a lot of spanish-speaking users don’t seem to agree on how to properly mean that, as can be seen from the following links:
Not even Google and Microsoft seem to find a sounded way of meaning the above:
I realized that this is an even worse problem among those hispanics and second generation latinos in the U.S. that are too “americanized” (if that is a valid term).
When we have problems finding terms from different languages with equivalent meanings, the best that we can do is to stop forcing translations and start describing what we want to say. This is a kind of a descriptive strategy.
For instance, according to http://arl-shipzine.com/issue-2-powered-by , “powered by” implies the notion of a service being “provided by”.
So, following the descriptive strategy, “provisto por” is a more appropriate option than ”impulsado por”, “potenciado por”, “generado por”, “auspiciado por”, “producido por” and other expressions that sound a bit ridiculous for the context in question. Of course, that’s just my opinion and you don’t have to agree with me.
“Impulsado por”…really? Are you launching a rocket?
“Potenciado por”…really? Are you a battery or power supply?
“Generado por”…really? Are you an electric generator?
“Auspiciado por”…really? Are you sponsoring something?
“Producido por”…really? Are you in a production business?
I know, I know. I’m being sarcastic. Me bad.
What is important to point out is that the above alternatives are subject to misinterpretations, while to imply “a product or service provided by”, o “un producto o servicio provisto por” has only one meaning: a product or service provided by someone, in a b2c (business-to-consumer) or b2b (business-to-business) context.
At first glance, the above seems trivial, but is not. You would be surprised to see the faces of those latinos that read web content and creatives translated by SEO companies with no knowledge about Spanish or that use automatic translators. Bad translations can ruin any marketing, press release, or link-building campaign.
The Images Crawler has arrived at Mi Islita.com. An easy way to view images from Web documents. Use it to view images from newspapers, forums, social networks, etc. Enjoy it!
Who can benefit from this tool?
- Webmasters and designers that want to check for broken images and their resolutions across browsers.
- Site owners checking if other sites are infringing on their images, pictures, banners, logos, or icons.
- Users looking for an easy way to find images from newspapers, forums, social networks, or any site.
- Marketing researchers, psychologists, and others interested in studying the power of images.
We have added to our email crawler
the following features:
1. A User Tracking Session (just find the link and click on it) to view current user data.
2. Search for user email addresses in the top search engines and social networks
Give it a try.
We plan to add the tracking session feature to all our pages. This feature is now visible to gives you an idea of how it works, but can be invisible to users. Geo and search data can be added in a snap.
Why pay monthly fees when you can have your own tracking service, customized to your needs?
We have just launched The Binary Similarity Calculator. This is a new tool for computing binary-based similarity measures that is available now.
What it is
The Binary Similarity Calculator (BSC) can be used to compare binary sets, groups consisting of only two types of items or states. These are item sets that can be represented as sequences of 1′s and 0′s.
Who can benefit from it
• Marketing analysts that need to examine Yes/No-type questionnaires about products and services.
• Teachers and examiners that must score Yes/No-type exams or assess plagiarism cases.
• Engineers, mathematicians, and physicists that must evaluate On/Off-type records.
• Statisticians, bioanalysts, and others involved with sequencing analysis.
• To sum up, anyone that uses binary sets.
As part of the renovation process at Mi Islita (http://www.miislita.com), we are introducing a link game called Qualified Links. The idea is simple.
An editor or owner of a page (in this case, us) end a page with what we call a “Qualified Links” section. A topic to follow is specified. Interested users submit a link to complete a sentence. Then the editor or owner of the page and at its sole discretion decides whether a submission is on topic. If so, the link is promoted to a list of Qualified Links on that page.
The result: the current page becomes more on-topic centered while submitters have a chance to gain more link building real estate. Only quality links are considered using our own criteria. Sure there will be some pros and cons down the road, but at least is an interesting start. Monetization can sure follow the shake.
From the Harlem Shake to the Link Shake: Here it comes the Qualified Links Shake :).