>> Über uns > Homepages > Christian Gütl > Courses > 506.418 Informa[..] > Allgemeines > Link List - Inf[..]

Information Search and Retrieval
LV-Nr. 506.418, Vorlesung-Übung
Stunden/Woche 3 SE


Vortragender: Dipl.-Ing. Dr.techn. Christian Gütl


IR Link List


Online Ressourcen Learning Material
Journals Online Courseware
Conferences Experimenting and Simulations
Online Books Software, Tools and  Research
Link Lists on IR Tools

Software

Research


Online Ressourcen

Organizations

Special Interest Group on Information Retrieval (SIGIR)
ACM Interest Group
http://www.acm.org/sigir/


The Information Retrieval Group
Department of Computing Science, University of Glasgow
http://ir.dcs.gla.ac.uk/

The Center for Intelligent Information Retrieval
University of Massachusetts, Amherst
http://ciir.cs.umass.edu/


Journals

Information Retrieval
The essential forum for theory and experimentation in Information Retrieval and its Applications
Kluwer Academic Publishers
http://www.kluweronline.com/issn/1386-4564

SIGIR Forum
http://www.acm.org/sigir/forum/index.html


Conferences

The International Conferences on Music Information Retrieval and Related Activities (ISMIR)
http://ismir2002.ircam.fr/

Text REtrievla Converence (TREC)
http://trec.nist.gov/

further information
http://www.acm.org/sigir/


Online Books

INFORMATION RETRIEVAL
A book by C. J. van RIJSBERGEN
http://www.dcs.gla.ac.uk/Keith/Preface.html


Link Lists on IR

Information Retrieval Research
SearchTools.com: Background Topics
http://www.searchtools.com/info/info-retrieval.html

Natural Language Processing in Information Retrieval Research
SearchTools.com: Background Topics
http://www.searchtools.com/info/ir-nlp.html

Resources for Text, Speech and Language Processing
http://www.cs.technion.ac.il/~gabr/resources/pointers.html

Information Filtering Resources
http://www.ee.umd.edu/medlab/filter/filter.html

Mixed Information Retrieval
http://wwwhome.cs.utwente.nl/~hiemstra/links.html


Information Retrieval
dmoz.org
http://dmoz.org/Computers/Software/Information_Retrieval/

Open Source Search Engines
http://www.searchtools.com/tools/tools-opensource.html

SourceForge.net: Software Map
Indexing/Search
http://sourceforge.net/softwaremap/trove_list.php?form_cat=93


Learning Material

Online Courseware

Intelligent Information Retrieval and Web Search
by Raymond J. Mooney, University of Texas, Austin, USA
scope: basic IR, advanced techniques in IR, web search
http://www.cs.utexas.edu/users/mooney/ir-course/

Course Material on Information Retrieval
Norbert Fuhr: Information Retrieval, Informatik.Uni-Duisburg
http://www.is.informatik.uni-duisburg.de/teaching/lectures/ir_ss03/folien/irskall.pdf
http://www.is.informatik.uni-duisburg.de/teaching/lectures/ir_ss04/

Course on Information Retrieval
James Allan, Toni Rath, Center for Intelligent Information Retrieval (CIIR), University of Massachusetts, Amherst
http://ciir.cs.umass.edu/cmpsci646/

Information Storage and Retrieval
Peter Brusilovsky, University of Pittsburg
http://www2.sis.pitt.edu/~peterb/2140-002/materials.html

Information Retreival
Simone Teufel, University of Cambridge Computer Laboratory
internal rating: ***
http://www.cl.cam.ac.uk/users/sht25/IRIE/

Information Retrieval (IR)
Dadabase Group, Stanford University
http://www-db.stanford.edu/cs347.2001.spring/course-info.html

Information Retrieval
K. Spärck Jones, Computer Lab, University of Cambridge
http://www.cl.cam.ac.uk/DeptInfo/CST99/node73.html

Information Retrieval
Karin Haenelt, Universität Heidelberg
http://kontext.fraunhofer.de/haenelt/kurs/InfoRet/kurs.html

Principles of Information Retrieval
Ray R. Larson, School of Information Management and Systems, UC Berkeley
http://www.sims.berkeley.edu/academics/courses/is240/s02/


Experimenting and Simulations

Algorithm Animations
http://www2.sis.pitt.edu/~peterb/2140-002/links.html

Natural Language Toolkit
http://nltk.sourceforge.net/

IR Java Classes
http://www.cs.utexas.edu/users/mooney/ir-course/ir.jar
http://www.cs.utexas.edu/users/mooney/ir-course/doc/index.html


Software, Tools and  Research

Tools

Natural Language Toolkit
NLTK, the Natural Language Toolkit, is a suite of Python libraries and programs for symbolic and statistical natural language processing. NLTK includes graphical demonstrations and sample data. It is accompanied by extensive documentation, including tutorials that explain the underlying concepts behind the language processing tasks supported by the toolkit. NLTK is ideally suited to students who are learning NLP (natural language processing) or conducting research in NLP or closely related areas, including empirical linguistics, cognitive science, artificial intelligence, information retrieval, and machine learning. NLTK has been used successfully as a teaching tool, as an individual study tool, and as a platform for prototyping and building research systems.
http://nltk.sourceforge.net/

The Lemur Toolkit for Language Modeling and Information Retrieval
Toolkit written in C and C++
http://www-2.cs.cmu.edu/~lemur/

IR Java Classes
http://www.cs.utexas.edu/users/mooney/ir-course/ir.jar
http://www.cs.utexas.edu/users/mooney/ir-course/doc/index.html


Software

Jakarta Lucene
Jakarta Lucene is a high-performance, full-featured text search engine written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform.
http://jakarta.apache.org/lucene/docs/index.html

SWISH-E
SWISH-E is a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other files.
http://swish-e.org/

Zebra
Zebra is a high-performance, general-purpose structured text indexing and retrieval engine. It reads structured records in a variety of input formats (eg. email, XML, MARC) and allows access to them through exact boolean search expressions and relevance-ranked free-text queries.
http://www.indexdata.dk/zebra/

WebGlimps
Webglimpse has two parts : Glimpse, the fast C engine which does the text indexing and pattern matching, and Webglimpse proper, the flexible Perl spider, archive manager and user interface script.
http://glimpse.cs.arizona.edu/index.php?dir=subdocs&page=overview.html

ht://Dig
Website Search Engine written in C++.
http://www.htdig.org/

Nutch
Open Source Search Engine, implemented in pure Java
http://www.nutch.org/

Grub
distributet Web Crawling
http://www.grub.org/

Amberfish
The distinguishing features of Amberfish are indexing/search of semi-structured text (i.e. both free text and multiply nested fields), built-in support for XML documents using the Xerces library, structured queries allowing generalized field/tag paths, hierarchical result sets (XML only), automatic searching across multiple databases (allowing modular indexing), and relatively low memory requirements during indexing (and the ability to index documents larger than available memory). Other features include standard Boolean queries, right truncation, phrase searching, relevance ranking, support for multiple documents per file, and easy integration with other UNIX tools.
http://www.etymon.com/tr.html

Isearch
Isearch is open source text retrieval software developed in 1994 by Nassib Nassar at the Clearinghouse for Networked Information Discovery and Retrieval (CNIDR). The main features of Isearch include full text and field searching, relevance ranking, Boolean queries, and support for many document types such as HTML, mail folders, list digests, and text with SGML-style tags.
http://www.etymon.com/tr.html

xFIND
Web search system in distributed architecture, written in Java, developed at the IICM.
http://xfind.iicm.edu


Research

GridIR
is an architecture and specification for information retrieval in the context of grid computing. Its purpose is to enable distributed, dynamic information systems to be created and searched securely.
http://www.gridir.org/

OpenNLP
is an organizational center for open source projects related to natural language processing.
http://opennlp.sourceforge.net/



author: Christian Gütl
email: cguetl@iicm.edu
last update: 2010-09-22