CS635: Web Search and Mining A
Autumn 2009
Javascript is needed to view this page. Please report any
dead links to me.
Administrivia
- Instructor
- Soumen Chakrabarti
- Teaching assistants
- Devshree Sane
- Time and place
- We are in
slot 12,
Mon, Thu 5--6:25,
SIC301 SIC205.
- Moodle
- Visit the Moodle
site for CS635 here.
- Assignments
- Instead of batched homeworks we will offer one ever-growing log file of homeworks
with a separate due date for each problem. To be worked out in small
groups of 2--3 students.
- Calendar
- What topic was covered
when.
Resources
Books in decreasing order of use made in this course:
- Witten, I. H., Moffat, A., and Bell, T. C. (1999). Managing
Gigabytes: Compressing and Indexing Documents and
Images. Morgan-Kaufman.
- Chakrabarti, S. (2002). Mining the
Web: Discovering knowledge from hypertext
data. Morgan-Kaufman.
- Baeza-Yates, R. and Ribeiro-Neto, B. (1999). Modern Information
Retrieval. Pearson Education.
- Manning, C., Raghavan, P., and Schutze, H. (2007). An Introduction to
Information Retrieval. Cambridge University Press.
- Grossman, D. A. and Frieder, O. (1998). Information Retrieval:
Algorithms and Heuristics. Kluwer.
- Croft, B., Metzler, D., and Strohman, T. (2009, to appear). Search Engines:
Information Retrieval in Practice. Pearson Education.
But our main resource will be papers, which will be linked off the
course calendar.
Evaluation
Credit students will need to write a midterm
(approximately 30%) and a final (approximately 40%) exam,
and do a few assignments, typically using Java, Scilab and WEKA
(approximately 30%).
Audit students have to write only the final exam and, to
pass, their score must be above the bottom 20 percent of the class
in finals only.
Eligibility
Mtech1s, DD4s and Btech3s are the primary targets for this
course. But it is open to all PG, DD and Btech4 students of all
departments subject to department/facad approval. If you are CSE
Btech3, ask for my consent by email while citing your CPI and
grades in Paradigms and Prob/Stat.