Information Retrieval Services Directories
Directories
Directories are also known as Web catalogs and are compiled and maintained by human editors and researchers.
Another characteristic of a directory is a list of categories in hierarchical order (most broad to most specific).
You can "search" by clicking on a category and making selections from categories that are increasingly specific.
Some examples of directories are:
Directories
Directories carry weight and can hurt your rankings.
This section will mainly focus on general directories, but the end of this section does include a few tips on how to find niche directories
(or even other general directories) and how to determine if they are worth getting a listing on.
Yahoo
The Yahoo Directory was started in 1994 under the name "Jerry and David's Guide to the World Wide Web" but in 1996 became Yahoo.
At the time Yahoo was primarily a directory with search functionality and (interestingly) neither SEO nor Internet Marketing were even categories at the time.
Through the late 1990s Yahoo pushed to become a web portal and in 2000 even signed a deal with Google that would see Google power Yahoo's search functionality. Their focus at the time was to acquire users through acquisitions such as GeoCities,
bringing more people into their portal and keeping them there. Unfortunately Yahoo did not have the same user loyalty that Apple does and the
walled-garden approached failed as users Googled their way out of the Yahoo network of sites.
- About.com
- Yahoo!
Directories are discussed in more detail in the next module. For now, take a quick look at the main page (or home page) of each directory and note the number and type of categories that each offers.
Question:
Do any seem more inviting to use?
Is one directory organized in a manner that makes more sense to you?
Or do they seem visually identical to each other?
Clicking on any of these links will open the Web site in a separate browser window, so you can switch between the lesson and the website.
Automated IR
Automated information retrieval (IR) systems were originally developed to help manage the huge scientific
literature that has developed since the 1940s. Many university and public libraries now use IR systems to
provide access to books, journals, and other documents.
Commercial IR systems offer databases containing millions of documents in myriad subject areas.
Dictionary and encyclopedia databases are now widely available for PCs.
Information Retrieval has been found useful in such disparate areas as office automation and software engineering.
Indeed, any discipline that relies on documents to do its work could potentially use and benefit from IR.
An IR system matches user queries to documents stored in a database.
A document is a data object, usually textual, though it may also contain other types of data such as photographs, graphs.
Often, the documents themselves are not stored directly in the IR system, but are represented in the system by document
surrogates. This web page is a document and could be stored in its entirety in an IR database.
One might instead, however, choose to create a document surrogate for it consisting of the title, author, and abstract.
This is typically done for efficiency to reduce the size of the database and searching time.
An IR system must support certain basic operations. There must be a way to enter documents into a database,
change the documents, and delete them. There must also be some way to search for documents, and present them to
a user. IR systems vary greatly in the ways they accomplish these tasks.