HITS

Creator
Creator
Seonglae Cho
Created
Created
2025 Mar 5 23:30
Editor
Edited
Edited
2025 Mar 5 23:38
Refs
Refs

Hyperlink-Induced Topic Search (HITS) Model

Unlike PageRank, HITS is calculated on a per-query basis and demonstrates superior search performance for specific topics. It trying to solve Abundance Problem by selecting pages based on authority and hub. The model identifies two types of important pages in a hyperlinked environment:

Authority Pages with Authority score

A good authority has links from many good hubs
  • Pages that receive links from many high-quality hub pages
  • Considered authoritative sources for specific topics

Hub Pages with Hub score

A good hub links to many good authorities
  • Pages that link to many high-quality authority pages
  • Act as curated lists or directories of authoritative content
This will turn this into an Circular definition architecture will turn this into an iterative computation
 
 
 

1. Root Set

Given text query, use a text index to get all pages containing query.

2. Base Set

Add in any page that either points to a page in the root set, or is pointed to by a page in the root set.

3. HITS Iteration

 
 
 
 
 

Recommendations