| Information Search and Retrieval LV-Nr. 506.418, Vorlesung-Übung Stunden/Woche 3 SE Vortragender: Dipl.-Ing. Dr.techn. Christian Gütl IR Link List
Online Ressourcen Organizations Special Interest Group on Information Retrieval (SIGIR) ACM Interest Group http://www.acm.org/sigir/ The Information Retrieval Group Department of Computing Science, University of Glasgow http://ir.dcs.gla.ac.uk/ The Center for Intelligent Information Retrieval University of Massachusetts, Amherst http://ciir.cs.umass.edu/ Journals Information Retrieval The essential forum for theory and experimentation in Information Retrieval and its Applications Kluwer Academic Publishers http://www.kluweronline.com/issn/1386-4564 SIGIR Forum http://www.acm.org/sigir/forum/index.html Conferences The International Conferences on Music Information Retrieval and Related Activities (ISMIR) http://ismir2002.ircam.fr/ Text REtrievla Converence (TREC) http://trec.nist.gov/ further information http://www.acm.org/sigir/ Online Books INFORMATION RETRIEVAL A book by C. J. van RIJSBERGEN http://www.dcs.gla.ac.uk/Keith/Preface.html Link Lists on IR Information Retrieval Research SearchTools.com: Background Topics http://www.searchtools.com/info/info-retrieval.html Natural Language Processing in Information Retrieval Research SearchTools.com: Background Topics http://www.searchtools.com/info/ir-nlp.html Resources for Text, Speech and Language Processing http://www.cs.technion.ac.il/~gabr/resources/pointers.html Information Filtering Resources http://www.ee.umd.edu/medlab/filter/filter.html Mixed Information Retrieval http://wwwhome.cs.utwente.nl/~hiemstra/links.html Information Retrieval dmoz.org http://dmoz.org/Computers/Software/Information_Retrieval/ Open Source Search Engines http://www.searchtools.com/tools/tools-opensource.html SourceForge.net: Software Map Indexing/Search http://sourceforge.net/softwaremap/trove_list.php?form_cat=93 Learning Material Online Courseware Intelligent Information Retrieval and Web Search by Raymond J. Mooney, University of Texas, Austin, USA scope: basic IR, advanced techniques in IR, web search http://www.cs.utexas.edu/users/mooney/ir-course/ Course Material on Information Retrieval Norbert Fuhr: Information Retrieval, Informatik.Uni-Duisburg http://www.is.informatik.uni-duisburg.de/teaching/lectures/ir_ss03/folien/irskall.pdf http://www.is.informatik.uni-duisburg.de/teaching/lectures/ir_ss04/ Course on Information Retrieval James Allan, Toni Rath, Center for Intelligent Information Retrieval (CIIR), University of Massachusetts, Amherst http://ciir.cs.umass.edu/cmpsci646/ Information Storage and Retrieval Peter Brusilovsky, University of Pittsburg http://www2.sis.pitt.edu/~peterb/2140-002/materials.html Information Retreival Simone Teufel, University of Cambridge Computer Laboratory internal rating: *** http://www.cl.cam.ac.uk/users/sht25/IRIE/ Information Retrieval (IR) Dadabase Group, Stanford University http://www-db.stanford.edu/cs347.2001.spring/course-info.html Information Retrieval K. Spärck Jones, Computer Lab, University of Cambridge http://www.cl.cam.ac.uk/DeptInfo/CST99/node73.html Information Retrieval Karin Haenelt, Universität Heidelberg http://kontext.fraunhofer.de/haenelt/kurs/InfoRet/kurs.html Principles of Information Retrieval Ray R. Larson, School of Information Management and Systems, UC Berkeley http://www.sims.berkeley.edu/academics/courses/is240/s02/ Experimenting and Simulations Algorithm Animations http://www2.sis.pitt.edu/~peterb/2140-002/links.html Natural Language Toolkit http://nltk.sourceforge.net/ IR Java Classes http://www.cs.utexas.edu/users/mooney/ir-course/ir.jar http://www.cs.utexas.edu/users/mooney/ir-course/doc/index.html Software, Tools and Research Tools Natural Language Toolkit NLTK, the Natural Language Toolkit, is a suite of Python libraries and programs for symbolic and statistical natural language processing. NLTK includes graphical demonstrations and sample data. It is accompanied by extensive documentation, including tutorials that explain the underlying concepts behind the language processing tasks supported by the toolkit. NLTK is ideally suited to students who are learning NLP (natural language processing) or conducting research in NLP or closely related areas, including empirical linguistics, cognitive science, artificial intelligence, information retrieval, and machine learning. NLTK has been used successfully as a teaching tool, as an individual study tool, and as a platform for prototyping and building research systems. http://nltk.sourceforge.net/ The Lemur Toolkit for Language Modeling and Information Retrieval Toolkit written in C and C++ http://www-2.cs.cmu.edu/~lemur/ IR Java Classes http://www.cs.utexas.edu/users/mooney/ir-course/ir.jar http://www.cs.utexas.edu/users/mooney/ir-course/doc/index.html Software Jakarta Lucene Jakarta Lucene is a high-performance, full-featured text search engine written entirely in Java. It is a technology suitable for nearly any application that requires full-text search, especially cross-platform. http://jakarta.apache.org/lucene/docs/index.html SWISH-E SWISH-E is a fast, powerful, flexible, free, and easy to use system for indexing collections of Web pages or other files. http://swish-e.org/ Zebra Zebra is a high-performance, general-purpose structured text indexing and retrieval engine. It reads structured records in a variety of input formats (eg. email, XML, MARC) and allows access to them through exact boolean search expressions and relevance-ranked free-text queries. http://www.indexdata.dk/zebra/ WebGlimps Webglimpse has two parts : Glimpse, the fast C engine which does the text indexing and pattern matching, and Webglimpse proper, the flexible Perl spider, archive manager and user interface script. http://glimpse.cs.arizona.edu/index.php?dir=subdocs&page=overview.html ht://Dig Website Search Engine written in C++. http://www.htdig.org/ Nutch Open Source Search Engine, implemented in pure Java http://www.nutch.org/ Grub distributet Web Crawling http://www.grub.org/ Amberfish The distinguishing features of Amberfish are indexing/search of semi-structured text (i.e. both free text and multiply nested fields), built-in support for XML documents using the Xerces library, structured queries allowing generalized field/tag paths, hierarchical result sets (XML only), automatic searching across multiple databases (allowing modular indexing), and relatively low memory requirements during indexing (and the ability to index documents larger than available memory). Other features include standard Boolean queries, right truncation, phrase searching, relevance ranking, support for multiple documents per file, and easy integration with other UNIX tools. http://www.etymon.com/tr.html Isearch Isearch is open source text retrieval software developed in 1994 by Nassib Nassar at the Clearinghouse for Networked Information Discovery and Retrieval (CNIDR). The main features of Isearch include full text and field searching, relevance ranking, Boolean queries, and support for many document types such as HTML, mail folders, list digests, and text with SGML-style tags. http://www.etymon.com/tr.html xFIND Web search system in distributed architecture, written in Java, developed at the IICM. http://xfind.iicm.edu Research GridIR is an architecture and specification for information retrieval in the context of grid computing. Its purpose is to enable distributed, dynamic information systems to be created and searched securely. http://www.gridir.org/ OpenNLP is an organizational center for open source projects related to natural language processing. http://opennlp.sourceforge.net/ author: Christian Gütl email: cguetl@iicm.edu last update: 2010-09-22 |
>> Über uns > Homepages > Christian Gütl > Courses > 506.418 Informa[..] > Allgemeines > Link List - Inf[..]
