Finding Resources with MHM

Following our previous post on our recent host mining tool, available at http://www.minerazzi.com/tools/mhm/mhm.php

If a search retrieves few results, try submitting one of these as the initial query or one of the alternate searches the tool suggests you. An example follows:

A search for “craiglist.com” returns 0 results. One of the alternate searches is “www.craiglist.org” Submitting this new query returns 260+ results.

abbotsford.en.craigslist.ca
aberdeen.craigslist.co.uk
abilene.craigslist.org
albany.craigslist.org
albuquerque.craigslist.org
allentown.craigslist.org
altoona.craigslist.org
amarillo.craigslist.org
amsterdam.craigslist.org
anchorage.craigslist.org
annarbor.craigslist.org
appleton.craigslist.org
asheville.craigslist.org
atlanta.craigslist.org
augusta.craigslist.org
austin.craigslist.org
baltimore.craigslist.org
bangkok.craigslist.co.th
bellingham.craigslist.org
bemidji.craigslist.org
bend.craigslist.org
bgky.craigslist.org
bham.craigslist.org
binghamton.craigslist.org
blacksburg.craigslist.org
boise.craigslist.org
boston.craigslist.org
boulder.craigslist.org
brownsville.en.craigslist.org
buffalo.craigslist.org
burlington.craigslist.org
butte.craigslist.org
calibex.com
capecod.craigslist.org
caracas.craigslist.org
carbondale.craigslist.org
caribbean.craigslist.org
centralmich.craigslist.org
charleston.craigslist.org
charlotte.craigslist.org
charlottesville.craigslist.org
chattanooga.craigslist.org
chicago.craigslist.org
chico.craigslist.org
cincinnati.craigslist.org
cnj.craigslist.org
cnj.en.craigslist.org
collegestation.craigslist.org
colombia.craigslist.org
columbia.craigslist.org
columbiamo.craigslist.org
columbus.craigslist.org
columbusga.craigslist.org
corpuschristi.craigslist.org
corvallis.craigslist.org
cosprings.craigslist.org
costarica.craigslist.org
costarica.en.craigslist.org
craiglist.org
craigslist-search.com
craigslist.ca
craigslist.ch
craigslist.co.uk
craigslist.com
craigslist.fi
craigslist.jp
craigslist.org
craigslist.pl
craigslistcompanion.com
craigslistproxy.net
dallas.craigslist.org
dayton.craigslist.org
daytona.craigslist.org
decatur.craigslist.org
delaware.craigslist.org
delrio.craigslist.org
denver.craigslist.org
detroit.craigslist.org
dothan.craigslist.org
dublin.craigslist.org
duluth.craigslist.org
easternshore.craigslist.org
eastidaho.craigslist.org
eastnc.craigslist.org
eastoregon.craigslist.org
easttexas.craigslist.org
eauclaire.craigslist.org
elmira.craigslist.org
erie.craigslist.org
eugene.craigslist.org
evansville.craigslist.org
fargo.craigslist.org
fayar.craigslist.org
fayetteville.craigslist.org
flagstaff.craigslist.org
flint.craigslist.org
fortcollins.craigslist.org
fortlauderdale.craigslist.org
fortmyers.craigslist.org
fortwayne.craigslist.org
fredericksburg.craigslist.org
fresno.craigslist.org
gainesville.craigslist.org
geo.craigslist.org
goldcountry.craigslist.org
grandisland.craigslist.org
grandrapids.craigslist.org
greenbay.craigslist.org
greensboro.craigslist.org
greenville.craigslist.org
hamilton.en.craigslist.ca
harrisburg.craigslist.org
harrisonburg.craigslist.org
hickory.craigslist.org
honolulu.craigslist.org
houston.craigslist.org
hudsonvalley.craigslist.org
humboldt.craigslist.org
huntsville.craigslist.org
indianapolis.craigslist.org
indianapolis.en.craigslist.org
inlandempire.craigslist.org
iowacity.craigslist.org
ithaca.craigslist.org
jacksontn.craigslist.org
jacksonville.craigslist.org
jerseyshore.craigslist.org
joplin.craigslist.org
kalamazoo.craigslist.org
kansascity.craigslist.org
kenai.craigslist.org
killeen.craigslist.org
kitchener.en.craigslist.ca
knoxville.craigslist.org
kokomo.craigslist.org
kpr.craigslist.org
lacrosse.craigslist.org
lakeland.craigslist.org
lansing.craigslist.org
lascruces.en.craigslist.org
lasvegas.craigslist.org
lawrence.craigslist.org
lexington.craigslist.org
lima.craigslist.org
lincoln.craigslist.org
logan.craigslist.org
london.craigslist.co.uk
longisland.craigslist.org
losangeles.craigslist.org
louisville.craigslist.org
macon.craigslist.org
madison.craigslist.org
manchester.craigslist.co.uk
mansfield.craigslist.org
marshall.craigslist.org
mcallen.en.craigslist.org
memphis.craigslist.org
mendocino.craigslist.org
miami.craigslist.org
milwaukee.craigslist.org
minneapolis.craigslist.org
mobile.craigslist.org
modesto.craigslist.org
montana.craigslist.org
monterey.craigslist.org
montgomery.craigslist.org
muncie.craigslist.org
nashville.craigslist.org
nd.craigslist.org
newhaven.craigslist.org
newjersey.craigslist.org
newlondon.craigslist.org
neworleans.craigslist.org
newyork.craigslist.org
nextag.com
nh.craigslist.org
norfolk.craigslist.org
northernwi.craigslist.org
northmiss.craigslist.org
nottingham.craigslist.co.uk
nwct.craigslist.org
ocala.craigslist.org
oklahomacity.craigslist.org
olympic.craigslist.org
omaha.craigslist.org
orangecounty.craigslist.org
oregoncoast.craigslist.org
orlando.craigslist.org
panamacity.craigslist.org
paris.craigslist.fr
philadelphia.craigslist.org
phoenix.craigslist.org
pittsburgh.craigslist.org
porthuron.craigslist.org
portland.craigslist.org
prescott.craigslist.org
providence.craigslist.org
provo.craigslist.org
puertorico.en.craigslist.org
pullman.craigslist.org
quadcities.craigslist.org
redding.craigslist.org
reno.craigslist.org
richmond.craigslist.org
rmn.craigslist.org
roanoke.craigslist.org
rochester.craigslist.org
rockford.craigslist.org
roseburg.craigslist.org
sacramento.craigslist.org
salem.craigslist.org
saltlakecity.craigslist.org
sandiego.craigslist.org
sandusky.craigslist.org
santafe.craigslist.org
sarasota.craigslist.org
savannah.craigslist.org
scranton.craigslist.org
sd.craigslist.org
seattle.craigslist.org
sfbay.craigslist.org
shreveport.craigslist.org
siouxfalls.craigslist.org
skagit.craigslist.org
slo.craigslist.org
smd.craigslist.org
spacecoast.craigslist.org
spokane.craigslist.org
springfield.craigslist.org
stcloud.craigslist.org
stgeorge.craigslist.org
stillwater.craigslist.org
stlouis.craigslist.org
stpetersburg.craigslist.org
swva.craigslist.org
syracuse.craigslist.org
tampa.craigslist.org
telaviv.craigslist.org
terrahaute.craigslist.org
tippecanoe.craigslist.org
toledo.craigslist.org
topeka.craigslist.org
toronto.en.craigslist.ca
tricities.craigslist.org
tucson.craigslist.org
tulsa.craigslist.org
valdosta.craigslist.org
vancouver.craigslist.ca
ventura.craigslist.org
vietnam.craigslist.org
visalia.craigslist.org
washingtondc.craigslist.org
watertown.craigslist.org
wenatchee.craigslist.org
westernmass.craigslist.org
westky.craigslist.org
westslope.craigslist.org
wichita.craigslist.org
williamsport.craigslist.org
wilmington.craigslist.org
winchester.craigslist.org
wyoming.craigslist.org
yakima.craigslist.org
york.craigslist.org
yuma.craigslist.org

 

Praise MHM.

Improving MHM, our hosts mining tool

We have improved our Minerazzi Hosts Miner (MHM) available at http://www.minerazzi.com/tools/mhm/mhm.php

The tool now provides alternate searches. We found that the discovered alternate searches some times retrieve additional resources.

Among other useful applications the tool simplifies the building of topic-specific collections and micro-indexes.

For instance, querying microsoft.com retrieves 6 results:

answers.microsoft.com
microsoft.com
microsoftproductionstudios.biz
microsoftproductionstudios.org
msdn.microsoft.com
research.microsoft.com

MHM then suggests several alternate searches. One of these is

00001001.ch

Querying this new address (which at the time of writing resolves to 65.55.58.201), retrieves 74 new results:

00001001.ch
adjuncate.com
alladinbottle.com
alterwind.microsoft.ie
amazon.com
answers.microsoft.com
buildmypinnedsite.com
descubradynamics.com.br
engkoo.jp
eufacooquegosto.com.br
gin-green.com
ieak.microsoft.com
microsoft.be
microsoft.by
microsoft.cl
microsoft.co.il
microsoft.co.kr
microsoft.co.nz
microsoft.co.uk
microsoft.co.za
microsoft.com
microsoft.com.al
microsoft.com.ar
microsoft.com.bd
microsoft.com.br
microsoft.com.mx
microsoft.com.my
microsoft.com.ph
microsoft.com.pl
microsoft.com.sa
microsoft.com.sg
microsoft.com.tw
microsoft.com.uy
microsoft.com.vn
microsoft.de
microsoft.es
microsoft.fi
microsoft.ge
microsoft.hu
microsoft.ie
microsoft.in
microsoft.it
microsoft.net
microsoft.pl
microsoft.ro
microsoft.rs
microsoft.ru
microsoft.si
microsoft.vn
microsoftdynamicsmarketplace.net
microsoftlearning.com
microsoftproductionstudios.com
microsoftproductionstudios.net
microsoftstoragepartners.com
microsrft.com
msdn.com
msdn.microsoft.com
mseventseurope.com
nokiamicrosoft.com
norouteto.com
officeonline.com.br
payeasysystem.com
peyron.ru
research.microsoft.com
rlalighting.com
support.microsoft.sk
sybari.com
sysinternals.com
technet.com
tellme.com
windows.com
windowsserver2008.com.br
windowsvista.com.br
xbox360.com.br

 

MHM: An Interesting Host Mining Tool

MHM is a tool for discovering sites on same host or IP and for the discovery of sites affiliated to each other, or that might be your competitor. It is available at

http://www.minerazzi.com/tools/mhm/mhm.php

It is great for discovering domain names branded with keywords or known name brands. Excellent also for discovering spam communities, domainers, and more.

You may also use it to build micro-indexes and topic-specific collections (as we do) or to chase down communities of personal interest to law and order agencies, recruiters, etc.

A Domain Intelligence Tool

A new domain intelligence tool is available now. http://www.minerazzi.com/tools/mdm/mdm.php
This tool checks if a brand, product, service, subdomain, initials, or keywords have been registered as a domain name. It helps you to secure the Web presence of your intellectual property while helping you to identify cybersquatters, domain brokers, and domainers.
Update: Minor glitches fixed today. Have fun :)

IP Intelligence on Reddit Banned Domains

One useful application of the Minerazzi’s URL Scoring Tool we just launched consists in doing some IP intelligence on a list of banned domains. Usually, those with a similar or common IP are in a shared hosting environment or have a common ownership, or both. This can be another piece of information that could help you identify those behind a set of domain names.

To do this, just google [banned domains] and follow a result that points to a list of domain names banned by a web property. Then paste the list in the MUST textarea and submit it. You may want to be sure the URLs are carriage return delimited (crd). In general, you could do the same analysis for lists of parked domains, hacker sites, registrar companies, affiliate program urls, etc.

Here is the result of checking this old list of Banned Domains by Reddit. To properly interpret the results, visit  our tool’s page.

 

Status Extracted URL IP Tested URL
http://xeducation.info xeducation.info
http://funny-on-youtube.com funny-on-youtube.com
200 http://echomon.co.uk 108.160.150.154 echomon.co.uk
200 http://www.imagetwist.com 162.159.240.244 http://www.imagetwist.com
200 http://www.imageporter.com 162.159.243.13 http://www.imageporter.com
200 http://blogs.discovermagazine.com 173.226.48.205 blogs.discovermagazine.com
200 http://news.discovery.com 206.190.79.225 news.discovery.com
200 http://www.sciencedaily.com 23.21.113.171 http://www.sciencedaily.com
200 http://imgflash.com/ 66.175.214.67 http://www.imgflash.com
200 http://www.globalpost.com 68.177.32.26 http://www.globalpost.com
200 http://www.businessweek.com 68.177.32.75 http://www.businessweek.com
405 http://bit.ly/ 69.58.188.39 bit.ly/
200 http://medicalxpress.com/ 69.9.167.166 http://www.medicalxpress.com
200 http://phys.org/ 69.9.167.167 http://www.phys.org
200 http://www.theatlantic.com 72.21.91.54 http://www.theatlantic.com
200 http://www.theatlanticcities.com 93.184.215.223 http://www.theatlanticcities.com
200 http://www.thewire.com/ 93.184.215.223 http://www.theatlanticwire.com