Today’s Internet Engineering Part 1 course lecture will be on DNS Intelligence and how we can use DNS records to understand virus and worm attacks as well as remote network topologies. Quite handy these days.
Please check Lecture 8
13 Tuesday Oct 2009
Posted in Graduate Courses, Internet Engineering
Today’s Internet Engineering Part 1 course lecture will be on DNS Intelligence and how we can use DNS records to understand virus and worm attacks as well as remote network topologies. Quite handy these days.
Please check Lecture 8
06 Tuesday Oct 2009
Posted in Graduate Courses, Internet Engineering, Spam
If you are enrolled in the IE-Part 1 course, here is some reference material on Email Headers for today’s lecture:
Exposing email headers
http://www.abs-comptech.com/EmailHeaders.htm
Tracking the source of email spam
http://www.rahul.net/falk/mailtrack.html
How to read email headers
http://www.emailaddressmanager.com/tips/header.html
Reading the email header
http://antivirus.about.com/od/windowsbasics/a/emailheaders.htm
Reading email headers
http://www.tinhat.com/email/read_email_headers.html
Spamlinks: Reading email headers
http://spamlinks.net/track-trace-headers.htm
ACCC: Reading Email Headers
http://www.uic.edu/depts/accc/newsletter/adn29/headers.html
E-mail Headers and SMTP Commands
http://www.avolio.com/columns/E-mailheaders.html
All About Email Headers
http://www.stopspam.org/index.php?option=com_content&view=article&id=45&Itemid=56
Security Optimization Strategies in the Workplace
http://www.miislita.com/searchito/security-optimization-strategies.html
05 Monday Oct 2009
Posted in Graduate Courses, Internet Engineering
If you are a student enrolled in the Internet Engineering I graduate course, check the Lecture 7 update.
We will be covering email protocols such as SMTP, POP3, and IMAP. The exercise section covers email headers intelligence and email crawlers.
28 Monday Sep 2009
Posted in Graduate Courses, Internet Engineering
If you are a student enrolled in the Internet Engineering I graduate course, check the Lecture 6 update.
I will be covering all about DNS configuration files. For the hands-on exercise section, we will be using nslookup commands to snoop at all relevant records of remote Web domains.
Use nslookup/? to access the options helper
Use nslookup followed by ? in a different line to access the commands helper
To quit nslookup, press ctrl C or either type quit or exit.
21 Monday Sep 2009
Posted in Graduate Courses, Hacking, Internet Engineering
The following are the lecture and exercise topics covered in the PUPR.edu core graduate course Internet Engineering, Part I. Students enrolled in the course might want to revisit this post as it will be updated.
Lecture 0
History of the Internet & Search Engines
Internet Basics
Lecture 1
RFCs (Request for Comments)
Network Types
IP (Internet Protocol)
Exercise 1 – RFCs, Network types, IP calculations
Lecture 2
OSI Reference Model
ARP
ICMP
Exercise 2 – IP-MAC Mapping, Prompt Commands (arp, ipconfig, nslookup)
Lecture 3
Man-in-the-Middle ARP Attacks
IGMP
IP Packets
Exercise 3 – Broadcast & Multicast IPs, Prompt Commands (netstat, ping, tracert, ipconfig, arp, nslookup)
Lecture 4
Fragmentation Offset
FO Overlapping Attacks
FO Gap Attacks
Tiny FO Attacks
TCP Protocol & Buffers
Exercise 4 – TCP buffers, Congestion Windows, Advertised Windows
Lecture 5
PING
PING of Death
Smurfing
TRACEROUTE-based Intelligence
Exercise 5 – Prompt Commands (arp, ipconfig, nslookup, netstat, ping, tracert)
Lecture 6
BIND & WINDOWS DNS (Domain Name Server)
Internet backbone root servers
Configuration Files
DNS Configuration Errors
Forward Lookup (Zone) Files
Reverse Lookup Files
Exercise 6 – Prompt Commands (interactive/non-interactive nslookup modes)
Lecture 7
SMTP
POP3
IMAP
Email Headers
Exercise 7 – Email Intelligence.
Lecture 8
DNS Intelligence
Using DNS records to understand Virus & Worm Attacks
Network Topology Intelligence from DNS records
Exercise 8 – DNS Intelligence
Lecture 9
General Review
Practice Test
Lecture 10
Final Exam, Oct 27
Course Grading System
8 out of 9 hands-on exercises count (worse exercise grade dropped)
1st partial exam = average of first best 4 exercise grades
2nd partial exam = average of last best 4 exercise grades
The average of these two is the same as adding up best 8 grades and dividing by 8. This result amounts to 75% of total grade (course letter grade score).
Final Exam amounts to 25 % of total grade.
After that, course letter grade is curved as shown below.
A (100-89%)
B (88-77%)
C (76-60%)
D (59-50%)
F (49-0%)
where
course letter grade score = (sum of best 8 exercise grades/8)*(0.75) + (final exam grade)*(0.25)
01 Tuesday Sep 2009
Posted in Graduate Courses, Hacking, Spam
As PUPR students know by now, the AIRWeb and Internet Engineering courses have been consolidated into a single course called Internet Engineering I (IE-I), which is on Tuesday’s.
This was a decision made strictly by the administration. 12 graduate students are enrolled –a big number for a grad course. We are now in the fourth week of IE-I and I can tell that is a lot of fun.
This coming Winter semester I’m scheduled to teach a new grad course called Advanced Search Engine Architecture (ASEA). Both, IE-I and ASEA are hands-on. This means students need to get their hands and feet wet, not just learning the theory.
What we are trying to accomplish in IE-I is to understand how hackers and spammers use Internet architectures at the level of TCP/IP and Search Engines to game the system. I’ll open a special blog category for it during the week.
First lecture (Lecture 1) was briefly summarized in the August 2009 issue of IR Watch. BTW. Tonight’s lecture (Lecture 4) covers the following:
IP Protocol (MAC and IP Mapping)
ICMP Protocol
ARP Hacking Attacks
ICMP Hacking Attacks
Firewall’s Fragmentation Offset Attacks
Meanwhile, ASEA is an expanded version of the previous Search Engine Architecture (SEA) course I’ve taught before. Students interested in registering, can search this blog for the SEA category and check what we have covered in the past. This will give them an idea of what to expect from the Advanced SEA course. One thing I’m planning to do different is to build an inverted index from scratch using AJAX. The most recent version of Terrier will also be used for testing/benchmarking experimentals.
Last but not least, September Issue of IRW will be a bit delayed.
02 Thursday Apr 2009
Posted in AIRWeb Course, Graduate Courses
During the Fall of 2009, I will be teaching
Adversarial Information Retrieval on the Web: A Graduate Course on Web Spam and Internet Vulnerabilities
This a new one-full semester graduate course to be offered at Polytechnic University Puerto Rico. It is based on the material presented at the annual AIRWeb Workshops. KDDM graduate students are encouraged to enroll. An early announcement and preliminary syllabus is available at
http://www.miislita.com/courses/airweb-web-spam-syllabus.pdf
BTW, In November 5 of 2008 PUPR became the First Academic Institution in the Caribbean to be Certified by the Committee on National Security Systems (CNSS). Additional information is available at http://www.pupr.edu/ias.html
Their goal is to become a Center of Academic Excellence in Information Assurance Education (CAE/IAE). These are great news. Nationwide, how many universities you know that are in such an exclusive ”club”?
01 Friday Aug 2008
Posted in Graduate Courses, Machine Learning, Marketing Research, Theses
The current issue of IRW, Graduate Students Research, is out. It consists of short abstracts of research conducted by graduate students.
In this issue:
Introduction
Genetic Algorithms, K-Means, and Fuzzy C-Means
Word Association Patterns
U-Site Search Engine Interface
Enhancement of a U-Site Search Engine Interface
News, Research, and Events
Terms of Use and Copyright
The next issue will go back to its how-to mode.
18 Wednesday Jun 2008
Posted in Graduate Courses, IR Tutorials, Machine Learning
Here is a question I included during the final examination of the Search Engines Architecture course. I am modifying the question. It might serve as a little quiz for non IR readers:
A collection consists of 500 documents. Some documents mention k1 and/or k2 keywords. If 100 mention k1, 200 mention k2, 70 mention k1 and k2, and 25 mention the k1 k2 terms sequence. Calculate the number of results for the following queries first, assuming terms independence and second assuming terms dependence. If the calculation is not possible from the provided data, write NC, ‘Not Computable’.
1. k1 NOT k2
2. k2 NOT k1
3. k1 OR k2 (unconditional OR)
4. k1 OR k2 (conditional OR)
5. NOT k1
6. NOT k2
7. NOT (k1 AND k2)
8. k1 AND k2 NOT (k1 k2)
9. EF-Ratio of the k1 k2 terms sequence
10. c12-index of the k1 k2 terms sequence
11. c12-index of k1 AND k2
12. IDF of k1
13. IDF of k2
14. IDF of k1 AND k2
15. IDF of k1 k2 terms sequence
Total Possible Scores: 15 points for terms independence and 15 points for terms dependence correct results.
Grading Yourself: A (100 – 90), B (89 – 80), C (79 – 70), D (69 -60), F(59 – 0)
Correct answers will be given during the week.
16 Friday May 2008
Week 10 Agenda
Lecture Session
Other Inverted Index Architectures
Divide-and-Conquer Strategies for Fast Indexing and Searching
Lab Session
Lectures and Lab Review
Final Examination Notes
Next week we have the final examination. This is an open book exam, with theory and practice sections.
To answer the test you need:
#2 pencil.
Calculator.
Working version of Terrier.
Tools developed during the course: parser, crawler, url and query normalizers, stemmer, etc.
Laptop (or a PC will be supplied to you).