• About IR Thoughts

IR Thoughts

~ Thoughts on Information Retrieval, Data Mining, and Search Engines

IR Thoughts

Category Archives: Conferences

NY SES – 2012: My little briefing

26 Monday Mar 2012

Posted by egarcia in Conferences, Marketing Research, Statistics and Mathematics

≈ Leave a Comment

I’m so happy to attend the NY SES Conference and Expo. Thanks, Mike and Incisive Media for inviting me as a guest and making my presentation possible. I had the opportunity of making new friends and business contacts. Below are some talking points.

My Impressions About this SES

Mike Grehan and Incisive Media put a lot of good work and effort to make the event a success. I have the opportunity to chat with the real movers-and-shakers of the industry, again (Mike Grehan, Bruce Clay, etc…). It is good to see again all these old friends.

SEO/SEM is moving slowly but steadily toward integrated marketing research, with statistics and few flavors of real Science, as it should be. They are still a long shot to that. The driving forces toward that are many: web analytics, metrics, and of course, Google and Bing.

I remember the days during my doctoral work (1988-1995) wherein back then we talked all the time about expert systems and IR systems. We were discussing cluster analysis, dimensionality reduction algorithms, sequential simplex optimization, chemometrics, fractals, chaos, markov chains, diffusion-limited aggregation, etc.

It was not until I got my PhD at ASU (1995) where I came across few articles talking about search engines. To my surprise, we were talking about the very same thing, only that using different nomenclature. Those days are gone now!

We are getting old and smarter cookies.

Well back to my SES summary. I hope the next few lines do not hurt any candy heart.

Room for Improvement

In general, speaker presentations were fine, with few exceptions. Here I’m providing some suggestions:

Power Point Slider

My slider did not work as expected. I thought it was just me. Then later on, when attending other sessions some speakers have problems sliding their presentations. I believe this technical problem will be resolved in future SES.

Presentations

Definitively, this area needs improvements. Some speakers insisted in embedding multiple sentences within slides. This is contra-productive as forces the audience to read the text while the speaker is talking. This is a NO-NO. Presentations are not reports. One should use figures, graphs, tables, diagrams in presentations to make a point, not lengthy sentences.

Adding insult to injury, some speakers used font sizes hard to read pass the third row of the audience. I assume they tried to comply with the SES template, which suggested a Calibri font size of 16 for the body.

To prevent this from happening, in the future a larger font size for the body, like between 24 to 28, should be used for SES templates. Personally, I prefer large font sizes. Using a large size, not only should allow the audience to discern any text, but will also prevent speakers from trying to stuff too many long sentences in a single slide. It will also help the audience’s attention to go where it should be: on the speaker and not on reading long sentences with tiny text.

To my surprise, I found some slides with text in white over soft-colored background and others with dark color over dark-colored backgrounds. Exactly. Many in the audience were not able to read them. Surprisingly, some veteran speakers were making these rookie mistakes. And even others didn’t worry about fixing many obvious typos. This was kind of disappointing.

Statistical Graphs

I found several graphs presenting percentages, without specifying the reference base. Other simply stated that the base was “various sources”. This is a NO_NO and should be avoided altogether.

Digital Media Coverage of the Self-Weighting Model (SWM)

I’m planning to put online some demos, tutorials, articles, etc about how SWM can be used. In the meantime, enjoy the following links.

SES Magazine, New York – March 19-23, 2012

http://sesconference.com/_pdfs/magazines/2012/sesmagazine-newyork2012.html

Full-page version: http://issuu.com/sesmagazine/docs/sesmag_ny12/1?mode=a_p

SES New York Kick-Off: Search Marketing Errors Uncovered, by Clarissa Sajbl

http://www.linkdex.com/blog/ses-new-york-kick-off-search-marketing-errors-uncovered/

Bruce Clay’s SES – Day 1 Report, by Virginia Nussey

http://www.bruceclay.com/blog/2012/03/business-optimization-digital-age-avinash-kaushik/

Sycara’s blog March 15, 2012 by Steve Sasman

http://www.sycara.com/blog/sycaras-ses-ny-preview-big-announcements-in-the-big-apple/

SES NY – See You All There!

10 Saturday Mar 2012

Posted by egarcia in Conferences, Marketing Research, Statistics and Mathematics

≈ Leave a Comment

SES NY

Well, it is now official. I’ll be at the New York City Search Engine Strategies conference.

Here is your chance for learning about The Self-Weighting Model. This is a new model for conducting statistical Meta-Analysis (not to be mistaken for meta tags analysis).

On Day 1:

I’ll be introducing the self-weighting model of meta-analysis to the search marketing community.

Later that day I’ll be at Meets the Experts Forum, Table 19, Web Analytics subject.

On Day 3:

I’ll be in the QA portion of the Metrics for SEO

http://sesconference.com/newyork/agenda-day3.php#metrics-seo

IFLA World Congress

20 Monday Jun 2011

Posted by egarcia in Conferences

≈ Leave a Comment

The International Federation of Library Associations and Institutions (IFLA) is the leading international body representing the interests of library and information services and their users.

They are having their world congress here in Centro de Convenciones de Puerto Rico from 13-18 of August. See http://www.ifla.org/

See also conference program at http://www.ifla.org/en/news/ifla-wlic-2011-final-programme

We will see you there.

TREC Call For Participation

21 Monday Dec 2009

Posted by egarcia in Conferences

≈ Leave a Comment

Dr. Ellen Voorhees from TREC at NIST.gov sent me their most recent Call For Participation. To help with its dissemination, I am posting it in its entirety, so it will reach out through  IRThoughts and Mi Islita.com  wider audience.

              CALL FOR PARTICIPATION

                    TEXT RETRIEVAL CONFERENCE (TREC)

                      February 2010 – November 2010

                          Conducted by:
      National Institute of Standards and Technology (NIST)

The Text Retrieval Conference (TREC) workshop series encourages
research in information retrieval and related applications by
providing a large test collection, uniform scoring procedures,
and a forum for organizations interested in comparing their
results.  Now in its nineteenth year, the conference has become
the major experimental effort in the field.  Participants in
the previous TREC conferences have examined a wide variety
of retrieval techniques and retrieval environments,
including cross-language retrieval, retrieval of web documents,
multimedia retrieval, and question answering.  Details about TREC
can be found at the TREC web site,  http://trec.nist.gov.

You are invited to participate in TREC 2010.  TREC 2010 will
consist of a set of tasks known as “tracks”.  Each track focuses
on a particular subproblem or variant of the retrieval task as
described below.  Organizations may choose to participate in any or
all of the tracks.  Training and test materials are available from
NIST for some tracks; other tracks will use special collections that
are available from other organizations for a fee.

Dissemination of TREC work and results other than in the (publicly
available) conference proceedings is welcomed, but the conditions of
participation specifically preclude any advertising claims based
on TREC results.  All retrieval results submitted to NIST are
published in the Proceedings and are archived on the TREC web site.
The workshop in November is open only to participating groups that
submit retrieval results for at least one track and to selected
government personnel from sponsoring agencies.

Schedule:
——–

  By February 18 — submit your application to participate in
        TREC 2010 as described below.  Submitting an application
        will add you to the active participants’ mailing list.
        On Feb 24, NIST will announce a new password for the “active
        participants” portion of the TREC web site.  Included
        in this portion of the web site is information regarding
        the permission forms needed to obtain the TREC document
        disks.

   Beginning March 2 — document disks used in some existing
        TREC collections distributed to participants who have
        returned the required forms.  Please note that no disks
        will be shipped before March 2.

   July–August  — results submission deadline for most tracks
        Specific deadlines for each track will be included in
        the track guidelines, which will be finalized in the spring.

   September 9  (estimated) — speaker proposals due at NIST.

   September 30 (estimated) — relevance judgments and individual
        evaluation scores due back to participants.

   Nov 16-19 — TREC 2010 conference at NIST in Gaithersburg, Md. USA

Task Description:
—————-

Below is a brief summary of the tasks.  Complete descriptions of
tasks performed in previous years are included in the Overview
papers in each of the TREC proceedings (in the Publications section
of the web site).

The exact definition of the tasks to be performed in each track for
TREC 2010 is still being formulated.  Track discussion takes place
on the track mailing list.  To be added to a track mailing list,
follow the instructions for contacting that mailing list as
given below.  For questions about the track, send mail to the
track coordinator (or post the question to the track mailing list
once you join).

TREC 2010 will contain seven tracks.  The blog, chemical IR,
entity, legal, relevance feedback, and web tracks will
continue from TREC 2009.  The million query track will be
incorporated into the web track.  TREC 2010 will also contain
a new “session” track.

Blog Track — The purpose of the blog track is to explore information
    seeking behavior in the blogosphere.

    Track coordinators: Craig Macdonald, Iadh Ounis, Ian Soboroff
                        trecblog-organisers@dcs.gla.ac.uk
    Mailing list:  send a mail message to listproc@nist.gov
        such that the body consists of the line
        subscribe trec-blog <FirstName> <LastName>

Chemical IR Track — The goal of the chemical IR track is to develop
    and evaluate technology for large scale search in chemical
    documents including academic papers and patents to better
    meet the needs of professional searchers: specifically patent
    searchers and chemists.

    Track co-ordinators: John Tait, john.tait@ir-facility.org
                         Jimmy Huang, jhuang@yorku.ca
                         Jianhan Zhu, j.zhu@adastral.ucl.ac.uk
                         Mhai Lupu, m.lupu@ir-facility.org

    Track Web Page: http://www.ir-facility.org/the_irf/trec_chem.htm
    Mailing List: follow the link on the web page to join the list

Entity Track — The overall aim of this track is to perform
    entity-related search on Web data.  These search tasks
    (such as finding entities and properties of entities) address
    common information needs that are not that well modeled as
    ad hoc document search.

    Track coordinators: Krisztian Balog, k.balog@uva.nl
                        Paul Thomas, Paul.Thomas@csiro.au
                        Arjen P. de Vries, arjen@acm.org
                        Thijs Westerveld, thijs.westerveld@teezir.nl
    Track web page: http://ilps.science.uva.nl/trec-entity/
    Mailing list: visit http://groups.google.com/group/trec-entity
        to apply for membership.

Legal Track — The goal of the legal track is to develop search technology
    that meets the needs of lawyers to engage in effective discovery
    in digital document collections.

    Track coordinators: Gord Cormack, gvcormac@uwaterloo.ca
                        Maura Grossman, MRGrossman@wlrk.com
                        Bruce Hedin, bhedin@h5.com
                        Doug Oard, oard@umd.edu
    Track web page: http://trec-legal.umiacs.umd.edu
    Mailing list: contact oard@umd.edu to be added to the list.

Relevance Feedback Track — The goal of the relevance feedback track
   is to provide a framework for exploring the effects of different
   factors on the success of relevance feedback.

   Track coordinators: Chris Buckley, cabuckley@sabir.com
                       Matt Lease, ml@ischool.utexas.edu
                       Mark Smucker, msmucker@engmail.uwaterloo.ca
   Track web page:  http://groups.google.com/group/trec-relfeed
   Mailing list: follow the instructions given on the track web page
                 to join the email list

Session Track –  The Session track has two primary goals: (1) to test
        whether systems can improve their performance for a given query
        by using a previous query (and search results from the search
        session), and (2) to evaluate system performance over an entire
        query session instead of a single query.

    Track coordinators: Ben Carterette, carteret@cis.udel.edu
                        Paul Clough, p.d.clough@sheffield.ac.uk
                        Evangelos Kanoulas, ekanou@ccs.neu.edu
                        Mark Sanderson, m.sanderson@sheffield.ac.uk

    Track web page: http://ir.cis.udel.edu/sessions
    Mailing list: Use the link given on the track web page to
                  join the email list

Web Track –  The Web track explores Web-specific retrieval tasks,
    including diversity and efficiency tasks, over collections of
    up to 1 billion Web pages.
    Track coordinators: Nick Craswell, nickcr@microsoft.com
                        Charles Clarke, claclark@plg.uwaterloo.ca
    Mailing list:  send a mail message to listproc@nist.gov
        such that the body consists of the line
        subscribe trec-web <FirstName> <LastName>

Conference Format
—————–

The conference itself will be used as a forum both for presentation
of results (including failure analyses and system comparisons),
and for more lengthy system presentations describing retrieval
techniques used, experiments run using the data, and other issues
of interest to researchers in information retrieval.  As there
is a limited amount of time for these presentations, the TREC
program committee will determine which groups are asked to speak
and which groups will present in a poster session.  Groups that
are interested in having a speaking slot during the workshop
should submit a 200-300 word abstract in September describing
the experiments they performed.  The program committee will use
these abstracts to select speakers.

Data
—-
Many of the existing TREC English collections (documents, topics,
and relevance judgments) are available for training purposes and
may also be used in some of the tracks.  Parts of the training
collection (Disks 1-3) were assembled from Linguistic Data
Consortium (LDC) text, and a signed User Agreement will be required
from all participants.  The documents are an assorted collection
of newspapers, newswire, journals, and technical abstracts.
A second agreement is needed for disks (4-5).

All documents are typical of those seen in a real-world situation
(i.e. there will not be arcane vocabulary, but there may be
missing pieces of text or typographical errors).  For most tracks,
the relevance judgments against which each system’s output will be
scored will be made by experienced relevance assessors based on the
output of all TREC participants using a pooled relevance methodology.
See the Overview paper in the TREC-8 proceedings (on the TREC
web site) for a detailed discussion of pooling.
  
  
Application details:
——————–
  
Organizations wishing to participate in TREC 2010 should respond
to this call for participation by submitting an application.
Participants in previous TRECs who wish to participate
in TREC 2010 must submit a new application.

To apply, follow the instructions at
     http://ir.nist.gov/trecsubmit.open/application.html
to submit an online application.  The application system
will send an acknowledgement to the email address
supplied in the form once it has processed the form.

Any questions about conference participation should be sent
to the general TREC email address, trec@nist.gov .

FIRE: Forum for Information Retrieval Evaluation

14 Wednesday Oct 2009

Posted by egarcia in Conferences

≈ Leave a Comment

Ellen Voorhees, Director of TREC at NIST.Gov sent me this Call for Participation, reproduced below to facilitate its dissemination:

CALL FOR PARTICIPATION

FIRE
(Forum for Information Retrieval Evaluation)
Workshop
DAIICT, Gandhinagar, India
19-21 February 2010

http://www.isical.ac.in/~fire

The success of TREC, CLEF, and NTCIR has clearly established the importance
of building reusable, large-scale standard test collections in Information
Access research. The aim of FIRE is to encourage research in Indian language
Information Access by creating a similar platform for Indian languages that
provides the data and a common forum for comparing models and techniques.

The Tasks:
==========
1) Ad-hoc monolingual document retrieval in Bengali, Hindi and Marathi.

2) Ad-hoc cross-lingual document retrieval
- documents in Bengali, Hindi, Marathi, and English,
- queries in Bengali, Hindi, Marathi, Tamil, Telugu and English.
- Bengali and Hindi topics will also be transliterated and made available
in Roman script. Adhoc monolingual task participants are encouraged to
submit runs using these queries as well.

3) Retrieval and classification from mailing lists and forums.
This is a pilot task being offered by IBM India Research Lab.

4)  Ad-hoc Wikipedia-entity retrieval from news documents
- Entities mined from English Wikipedia
- Query documents from English news website
This is a pilot task being offered by Yahoo! Labs, Bangalore.

Important Dates:
================
Ad-hoc monolingual and cross-lingual document retrieval:
Training data release   Aug 15 ’09
Test data release       Nov 01 ’09
Adhoc run submission    Nov 25 ’09
Results released        Feb 01 ’10

Retrieval and classification from mailing lists and forums:
Training data release   Oct 16 ’09
Test data release       Nov 01 ’09
Run submission          Nov 25 ’09
Results declared        Feb 01 ’10

Ad-hoc Wikipedia-entity retrieval from news documents:
Training data release   Oct 15 ’09
Test data release       Nov 01 ’09
Run submission          Nov 25 ’09
Results declared        Feb 01 ’10

Task Co-ordinators:
===================
Ad-hoc retrieval:
Pushpak Bhattacharyya (pb@cse.iitb.ac.in)
IIT Bombay
Dipasree Pal (dipasree_t@isical.ac.in)
ISI Kolkata

Retrieval and classification from mailing lists and forums:
Debapriyo Majumdar (debapriyo@in.ibm.com)
IBM India Research Lab
Ayan Bandyopadhyay (ayan_t@isical.ac.in)
ISI Kolkata

Ad-hoc Wikipedia-entity retrieval from news documents:
Ashwin Tengli (ashwint@yahoo-inc.com)
Yahoo! Labs, Bangalore
Pabitra Mitra (pabitra@cse.iitkgp.ernet.in)
IIT Kharagpur

Overall co-ordinators:
Prasenjit Majumder (p_majumder@daiict.ac.in)
DAIICT, Gandhinagar
Mandar Mitra (mandar@isical.ac.in)
ISI Kolkata

International Advisory Committee for FIRE:
==========================================
Amit Singhal, Google Fellow, USA
Carol Peters, ISTI-CNR, Italy
Christian Fluhr, CEA, France
Donna Harman, National Institute of Standards and Technology, USA
Doug Oard, University of Maryland, USA
Ee Peng Lim, Nanyang Technological University, Singapore
Ellen Voorhees, National Institute of Standards and Technology, USA
Fabrizio Sebastiani, ISTI-CNR, Italy
Gareth Jones, Dublin City University, Ireland.
Hsin-Hsi Chen, National Taiwan University, Taipei, Taiwan
Hwee Tou Ng, National University of Singapore, Singapore
Iadh Ounis, University of Glasgow, UK
Ian Soboroff, National Institute of Standards and Technology, USA
Jacques Savoy, University of Neuchatel, Switzerland
James Allan, University of Massachusetts Amherst, USA
Krishna Kummamuru, IBM Research Lab, India
Mark Sanderson, University of Sheffield, UK
Mun Kew Leong, Institute for Infocomm Research, Singapore
Norbert Fuhr, University of Duisburg, Germany
Noriko Kando, National Institute of Informatics, Japan
Paul McNamee, Johns Hopkins University, USA
Prabhakar Raghavan, Yahoo! Research Labs, USA
Ricardo Baeza-Yates, Yahoo! Research Labs, Spain
Stephen Robertson, Microsoft Research, Cambridge, UK
Sung Hyon Myaeng, KAIST, South Korea
Tat-Seng Chua, National University of Singapore, Singapore
Tetsuya Sakai, Microsoft Research Asia, Beijing

IR Videos in Spanish

22 Monday Jun 2009

Posted by egarcia in Conferences, IR Tutorials, Latent Semantic Indexing

≈ Leave a Comment

I normally do not put online my lecture notes (ppt, pdf, videos). However, there are two public conferences that event organizers taped. Both last over 1 hour and are in Spanish, but with slides in English. Here are the links. The quality of the videos is so-so.

Since the videos were made available few months later after the events, these are not properly dated. I have included below the actual date of the events. If you don’t know Spanish, you are out of luck.

1. Understanding Search Engines (Entendiendo a los Buscadores), University of Puerto Rico, Bayamon, 4-23-2008

http://video.google.com/videoplay?docid=-653964730907023811

This one last for about two hours. The audience consisted of grad students and researchers. Unfortunately, the video has an audio-visual mismatch of about one slide. If you can coupe with this, I hope you like it.

2. Demystifying LSI (Desmitificando LSI)- OJOBuscador Congress, Madrid, Spain, 3-09-2007.

http://www.ojotube.com/videos/congreso-ojobuscador-2007-ponencia-desmitificando-lsi-de-dr-e-garcia/

This one last for over one hour. Since it was for a non-scientific audience  (most Spanish SEOs)  I tried to talk very slow.

W3C 2009 Conference

26 Thursday Mar 2009

Posted by egarcia in Conferences

≈ Leave a Comment

Here is the final list conforming the 18 International Conference of the W3C, WWW2009, of which AIRWeb2009 is a workshop.

http://www.webshine.org/2009reg.html

A lot of good stuff to please IRs, CS students, spammers/SEOs, and hackers.

SIDIM XXIV Conference

05 Thursday Mar 2009

Posted by egarcia in Conferences, Data Mining, Homeland Security, Queries, Vector Space Models

≈ 4 Comments

I am presenting at The Seminario Interuniversitario de Investigación en Ciencias Matemáticas (Interuniversity Seminar on Mathematical Sciences Research, SIDIM).

This is one of the most important activities held in Puerto Rico for the promotion of Mathematics research. (http://sidim2009.uprr.pr/)

This year SIDIM will be held at University of Puerto Rico, Rio Piedras in March 6-7, 2009. The SIDIM program and book of abstracts  is available at http://sidim.uprh.edu/libroSIDIM2009.pdf

I will be presenting new research work on IDF and a new model for the conditional specificity of terms. If you have followed previous posts on the topic of inverse document frequency, now you will understand why I have dissected the topic several times. Thank you all for your private comments and feedback on the topic.

My abstract follows:

Scaled Inverse Document Frequency: A Model for the Evaluation of the Conditional Specificity of Query Terms in Search Engine Collections

Edel Garcia, Internet Business Development Center, Interamerican University of Puerto Rico, Metropolitan Campus

Inverse document frequency (IDF) is a measure of the specificity of query terms over a collection of D number of documents that has been successfully incorporated into numerous vector space information retrieval models. Since these models assume term independence, the specificity of a given term, present in different queries, is assumed to be unique and independent from other query terms. To the best of our knowledge, there are no known models that condition the specificity of terms to the presence of other terms in a query.

This paper proposes a new measure called scaled inverse document frequency (SIDF) which evaluates the conditional specificity of query terms over a subset S of D and without making any assumption about term independence. S can be estimated from search results, OR searches, or computed from inverted index data. We have evaluated SIDF values from commercial search engines by submitting queries relevant to the financial investment domain. Results compare favorably across search engines and queries. Our approach has practical applications for `real-world’ scenarios like in Web Mining, Homeland Security, and keyword-driven marketing research scenarios. SIDF can be incorporated into a variety of information retrieval models as a global weight scoring system.

Keywords: inverse document frequency, conditional term specificity, web mining, search engines

AND 2009 Conference

17 Tuesday Feb 2009

Posted by egarcia in Conferences

≈ Leave a Comment

L. Venkata Subramaniam, PhD, Manager – Information Processing and Analytics, IBM India Research Lab http://lvs004.googlepages.com sent us email with some great news. They are having the Third Workshop on Analytics for Noisy Unstructured Text Data (AND) on July 23-24, 2009, at Barcelona, Spain and asked us to disseminate the news.

Copy of the email follows.

——————–

Dear Edel,

We are organizing the Third Workshop on Analytics for Noisy Unstructured Text Data on July 23-24, 2009, at Barcelona, Spain.

We know you work in related areas and would be happy to have you submit your research work to this workshop.

Also I request you to add AND 2009 to the blog you are maintaining at: http://irthoughts.wordpress.com/

I know many IR researchers visit your blog and through the blog we will be able to reach.

AND 2009: http://and2009workshop.googlepages.com/

This is the third in the series of workshops: AND 2007 at IJCAI 2007: http://research.ihost.com/and2007/

AND 2008 at SIGIR 2008:
http://and2008workshop.googlepages.com/

Both earlier workshops resulted in ACM proceedings and journal special issues. Here are some details of AND 09:
http://and2009workshop.googlepages.com/

Workshop Name: Third Workshop on Analytics for Noisy Unstructured Text Data (AND 09) in conjunction with ICDAR 09

Submission Date: 20 April 2009
Notification Date: 20 May 2009
Workshop Dates: 23-24 July 2009 Workshop
Location: Barcelona Spain

Regards
Venkat

————————-

So now you know. Start making plans for attending this great workshop. If you are visiting Madrid, swing by to Barcelona for a few days. Don’t miss this unique opportunity.

AIRWeb2009 Call for Papers

08 Thursday Jan 2009

Posted by egarcia in Conferences, Spam

≈ Leave a Comment

Last night I received the following email from the organizing committee of AIRWeb2009 asking to disseminate the event:

Dear Edel,

Thank you again for agreeing to serve on the AIRWeb program committee. We have attached the AIRWeb CFP to this message and would appreciate your assistance in publicizing the workshop. The CFP is also available from the AIRWeb website – http://airweb.cse.lehigh.edu/2009/ .

Best regards,
Dennis Fetterly and Zoltan Gyongyi

This is my third year as a PC Member of AIRWeb. It is a lot of fun reviewing manuscripts to be presented at the event, months before the new anti-spamdexing and anti-adversarial IR practices are disseminated to the general public. Some, spammers like to wait and follow what comes out of AIRWeb to then try workarounds. This is a continuos arm race and cat-mouse chase.

So, this post goes as follows:

AIRWeb is a series of international workshops focusing on Adversarial Information Retrieval on the Web that brings together both researchers and industry practitioners, to present and discuss advances in the state of the art.

AIRWeb’09 will be co-located with the WWW2009 conference in Madrid, Spain. The workshop proceedings will be made available in the ACM Digital Library.

Important Dates

6 February 2009: Deadline (optional, but helpful) for abstract submissions

13 February 2009: Deadline for paper submissions
20 or 21

April 2009: Date of the workshop

Incidentally, I am observing another new wave of spammers, marketers, and johnny-comes-late talking in “IR tongues” to gain some credibility from easy to impress folks. Ironically their audience mostly consists of their peer SEOs. I guess the fight against spammers disguised as marketers never ends.

I wish I can beat the crap out of all these self-proclaimed SEO “experts” every single day through this blog. Fortunately, I have better things to do like conducting research, advicing students, writing IRW The Newsletter, preparing a paper for SIDIM 2009, peer reviewing IR manuscripts, and (reality checks-to-pay-bills) taking on enterprise projects.

← Older posts
May 2013
M T W T F S S
« Apr    
 12345
6789101112
13141516171819
20212223242526
2728293031  

Favorite Sites

  • Mi Islita

Pages

  • About IR Thoughts

Categories

  • AIRWeb Course
  • Conferences
  • Data Mining
  • Dynamics
  • Fractal Geometry
  • Graduate Courses
  • Hacking
  • Homeland Security
  • Human-Computer Interaction
  • Image Compression
  • Internet Engineering
  • IR Quizzes
  • IR Tools
  • IR Tutorials
  • Latent Semantic Indexing
  • Legacy Posts
  • Machine Learning
  • Marketing Research
  • Miscellaneous
  • News
  • Newsletters
  • Programming
  • Quack Science
  • Queries
  • Scripts
  • Search Engines Architecture Course
  • SEO Myths
  • Software
  • Spam
  • Statistics and Mathematics
  • Theses
  • Vector Space Models
  • Web Mining Course

Recent Posts

  • “Powered by” in Spanish
  • Some nice features added to the Image Crawler
  • The Images Crawler
  • A nice service for my locals
  • An update to the Web Crawler
  • New similarity measures
  • The Web Crawler is Back!
  • Tracking Users: An Email Crawler on Steroids
  • The Email Crawler: A Tool for Gathering Emails
  • The Binary Distance Calculator – a tool for comparing binary sets
  • Fractalettes: A Fractal Design Strategy to Color Mining and Learning through Discovery
  • AZZOO and WAZZOO: New Similarity Measures for the 21st Century
  • The Binary Similarity Calculator
  • From Harlem Shake to Link Shake: The Qualified Links Shake
  • Web Vulnerabilities and Search Engines

Archives

  • May 2013
  • April 2013
  • March 2013
  • February 2013
  • January 2013
  • December 2012
  • November 2012
  • October 2012
  • September 2012
  • August 2012
  • July 2012
  • June 2012
  • May 2012
  • April 2012
  • March 2012
  • February 2012
  • January 2012
  • December 2011
  • November 2011
  • October 2011
  • September 2011
  • August 2011
  • July 2011
  • June 2011
  • May 2011
  • April 2011
  • February 2011
  • January 2011
  • December 2010
  • November 2010
  • October 2010
  • September 2010
  • August 2010
  • July 2010
  • June 2010
  • May 2010
  • April 2010
  • March 2010
  • February 2010
  • January 2010
  • December 2009
  • November 2009
  • October 2009
  • September 2009
  • August 2009
  • July 2009
  • June 2009
  • May 2009
  • April 2009
  • March 2009
  • February 2009
  • January 2009
  • December 2008
  • November 2008
  • October 2008
  • September 2008
  • August 2008
  • July 2008
  • June 2008
  • May 2008
  • April 2008
  • March 2008
  • February 2008
  • January 2008
  • December 2007
  • November 2007
  • October 2007
  • September 2007
  • August 2007
  • July 2007
  • June 2007
  • May 2007
  • April 2007

AIRWeb Course Conferences Data Mining Fractal Geometry Graduate Courses Hacking Homeland Security Human-Computer Interaction Internet Engineering IR Quizzes IR Tools IR Tutorials Latent Semantic Indexing Legacy Posts Machine Learning Marketing Research Miscellaneous Newsletters Programming Quack Science Queries Scripts Search Engines Architecture Course SEO Myths Software Spam Statistics and Mathematics Theses Vector Space Models Web Mining Course

Blog at WordPress.com. Theme: Chateau by Ignacio Ricci.