Jump to content

Index term

From Wikipedia, the free encyclopedia

Ininformation retrieval,anindex term(also known assubject term,subject heading,descriptor,orkeyword) is a term that captures the essence of the topic of a document. Index terms make up acontrolled vocabularyfor use inbibliographic records.They are an integral part ofbibliographic control,which is the function by which libraries collect, organize and disseminate documents. They are used as keywords to retrieve documents in an information system, for instance, a catalog or asearch engine.A popular form of keywords on the web aretags,which are directly visible and can be assigned by non-experts. Index terms can consist of a word, phrase, or Alpha numerical term. They are created by analyzing the document either manually withsubject inde xingor automatically withautomatic inde xingor more sophisticated methods of keyword extraction. Index terms can either come from a controlled vocabulary or be freely assigned.

Keywords are stored in asearch index.Common words likearticles(a, an, the) and conjunctions (and, or, but) are not treated as keywords because it's inefficient. Almost every English-language site on the Internet has the article "the",and so it makes no sense to search for it. The most popular search engine,Googleremovedstop wordssuch as "the" and "a" from its indexes for several years, but then re-introduced them, making certain types of precise search possible again.

The term "descriptor" was byCalvin Mooersin 1948. It is in particular used about a preferred term from athesaurus.

TheSimple Knowledge Organization Systemlanguage (SKOS) provides a way to express index terms withResource Description Frameworkfor use in the context of theSemantic Web.[1]

In web search engines

[edit]

Mostweb search enginesare designed to search for words anywhere in a document—the title, the body, and so on. This being the case, a keyword can be any term that exists within the document. However, priority is given to words that occur in the title, words that recur numerous times, and words that are explicitly assigned as keywords within the coding.[2]Index terms can be further refined usingBoolean operatorssuch as "AND, OR, NOT." "AND" is normally unnecessary as most search engines infer it. "OR" will search for results with one search term or another or both. "NOT" eliminates a word or phrase from the search, getting rid of any results that include it. Multiple words can also be enclosed in quotation marks to turn the individual index terms into a specific indexphrase.These modifiers and methods all help to refine search terms, to better maximize the accuracy of search results.[3]

Author keywords

[edit]

Author keywords are an integral part of literature.[1]Many journals and databases provide access to index terms made by authors of the respective articles. How qualified the provider is decides the quality of both indexer-provided index terms and author-provided index terms. The quality of these two types of index terms is of research interest, particularly in relation toinformation retrieval.In general, an author will have difficulty providing inde xing terms that characterize his or her documentrelativeto other documents in the database.

Examples

[edit]

See also

[edit]

References

[edit]
  1. ^abSvenonius, Elaine(2009).The intellectual foundation of information organization(1st MIT Press pbk. ed.). Cambridge, Massachusetts: MIT Press.ISBN9780262512619.
  2. ^Cutts, Matt. (2010, March 4).How search works.Retrieved fromhttps:// youtube /watch?v=BNHR6IQJGZs
  3. ^CLIO.Keyword search.Columbia University Libraries. Retrieved fromhttp:// columbia.edu/cu/lweb/help/clio/keyword.html

Further reading

[edit]