Hyperlink-Induced Topic Search (HITS) Model
Unlike PageRank, HITS is calculated on a per-query basis and demonstrates superior search performance for specific topics. It trying to solve Abundance Problem by selecting pages based on authority and hub. The model identifies two types of important pages in a hyperlinked environment:
Authority Pages with Authority score
A good authority has links from many good hubs
- Pages that receive links from many high-quality hub pages
- Considered authoritative sources for specific topics
Hub Pages with Hub score
A good hub links to many good authorities
- Pages that link to many high-quality authority pages
- Act as curated lists or directories of authoritative content
This will turn this into an Circular definition architecture will turn this into an iterative computation
1. Root Set
Given text query, use a text index to get all pages containing query.
2. Base Set
Add in any page that either points to a page in the root set, or is pointed to by a page in the root set.
3. HITS Iteration
just as Multi-stage Recommender