Jump to content

Hilltop algorithm: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
GreenC bot (talk | contribs)
Move 1 url. Wayback Medic 2.5
Corrected long-standing misinformation. See Talk page.
Line 1: Line 1:
{{no footnotes|date=September 2016}}
{{no footnotes|date=September 2016}}
{{primary sources|date=September 2016}}
{{primary sources|date=September 2016}}
The '''Hilltop algorithm''' is an [[algorithm]] used to find documents relevant to a particular keyword topic. Created by [[Krishna Bharat]] while he was at [[DEC Systems Research Center|Compaq Systems Research Center]] and [[George A. Mihăilă]] [[University of Toronto]], it was acquired by [[Google]] in February 2003.
The '''Hilltop algorithm''' is an [[algorithm]] used to find documents relevant to a particular keyword topic in news search. Created by [[Krishna Bharat]] while he was at [[DEC Systems Research Center|Compaq Systems Research Center]] and [[George A. Mihăilă]] [[University of Toronto]], it was acquired by [[Google]] for use in its news results in February 2003.


When you enter a query or keyword into the [[Google Search|Google search engine]], the Hilltop algorithm helps to find relevant keywords whose results are more informative about the query or keyword.
When you enter a query or keyword into the [[Google News|Google news search engine]], the Hilltop algorithm helps to find relevant keywords whose results are more informative about the query or keyword.


The algorithm operates on a special index of ''expert documents''. These are pages that are about a specific topic and have links to many non-affiliated pages on that topic. Pages are defined as non-affiliated if they are authored by people from non-affiliated organizations. Results are ranked based on the match between the query and relevant descriptive text for [[Hyperlink|hyperlinks]] on expert pages pointing to a given result page. Websites which have [[backlinks]] from many of the best expert pages are ''authorities'' and are ranked well.
The algorithm operates on a special index of ''expert documents''. These are pages that are about a specific topic and have links to many non-affiliated pages on that topic. The original algorithm relied on independent directories with categorized links to sites. Results are ranked based on the match between the query and relevant descriptive text for [[Hyperlink|hyperlinks]] on expert pages pointing to a given result page. Websites which have [[backlinks]] from many of the best expert pages are ''authorities'' and are ranked well.


Basically, it looks at the relationship between the "expert" and "authority" pages: an "expert" is a page that links to many other relevant documents; an "authority" is a page that has links pointing to it from the "expert" pages. Here they mean pages about a specific topic with links to many non-affiliated pages on that topic. If a website has [[Backlink|backlinks]] from many of the best expert pages it will be an "authority".
Basically, it looks at the relationship between the "expert" and "authority" pages: an "expert" is a page that links to many other relevant documents; an "authority" is a page that has links pointing to it from the "expert" pages. Here they mean pages about a specific topic with links to many non-affiliated pages on that topic. If a website has [[Backlink|backlinks]] from many of the best expert pages it will be an "authority".

In theory, Google finds "expert" pages and then the pages that they link to would rank well. Pages on sites like [[Yahoo!]], [[DMOZ]], college sites and library sites are usually considered experts.


==See also==
==See also==
Line 19: Line 17:


== External links ==
== External links ==
* {{Crunchbase | krishna-bharat | Krishna Bharat | person }}
* [http://archives.digitaltoday.in/businesstoday/20040606/features6.html Interview with Krishna Bharat] [[Business_Today_(India)|Business Today]], June 6, 2004
* [http://ftp.cs.toronto.edu/pub/reports/csri/405/hilltop.html Hilltop: A Search Engine based on Expert Documents] by K. Bharat and G.A. Mihaila
* [http://ftp.cs.toronto.edu/pub/reports/csri/405/hilltop.html Hilltop: A Search Engine based on Expert Documents] by K. Bharat and G.A. Mihaila
* {{webarchive |url=https://web.archive.org/web/20070401224626/http://www.cs.toronto.edu/~georgem/hilltop/ |date=April 1, 2007 }}
* {{webarchive |url=https://web.archive.org/web/20070401224626/http://www.cs.toronto.edu/~georgem/hilltop/ |date=April 1, 2007 }}

Revision as of 19:33, 22 November 2021

The Hilltop algorithm is an algorithm used to find documents relevant to a particular keyword topic in news search. Created by Krishna Bharat while he was at Compaq Systems Research Center and George A. Mihăilă University of Toronto, it was acquired by Google for use in its news results in February 2003.

When you enter a query or keyword into the Google news search engine, the Hilltop algorithm helps to find relevant keywords whose results are more informative about the query or keyword.

The algorithm operates on a special index of expert documents. These are pages that are about a specific topic and have links to many non-affiliated pages on that topic. The original algorithm relied on independent directories with categorized links to sites. Results are ranked based on the match between the query and relevant descriptive text for hyperlinks on expert pages pointing to a given result page. Websites which have backlinks from many of the best expert pages are authorities and are ranked well.

Basically, it looks at the relationship between the "expert" and "authority" pages: an "expert" is a page that links to many other relevant documents; an "authority" is a page that has links pointing to it from the "expert" pages. Here they mean pages about a specific topic with links to many non-affiliated pages on that topic. If a website has backlinks from many of the best expert pages it will be an "authority".

See also