|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
| Packages that use org.archive.crawler.frontier | |
|---|---|
| org.archive.crawler.extractor | |
| org.archive.crawler.filter | |
| org.archive.crawler.framework | |
| org.archive.crawler.frontier | |
| org.archive.crawler.postprocessor | |
| Classes in org.archive.crawler.frontier used by org.archive.crawler.extractor | |
|---|---|
| AdaptiveRevisitAttributeConstants
Defines static constants for the Adaptive Revisiting module defining data keys in the CrawlURI AList. |
|
| Classes in org.archive.crawler.frontier used by org.archive.crawler.filter | |
|---|---|
| AdaptiveRevisitAttributeConstants
Defines static constants for the Adaptive Revisiting module defining data keys in the CrawlURI AList. |
|
| Classes in org.archive.crawler.frontier used by org.archive.crawler.framework | |
|---|---|
| FrontierJournal
Record of key Frontier happenings. |
|
| Classes in org.archive.crawler.frontier used by org.archive.crawler.frontier | |
|---|---|
| AbstractFrontier
Shared facilities for Frontier implementations. |
|
| AdaptiveRevisitAttributeConstants
Defines static constants for the Adaptive Revisiting module defining data keys in the CrawlURI AList. |
|
| AdaptiveRevisitHostQueue
A priority based queue of CrawlURIs. |
|
| AdaptiveRevisitQueueList
Maintains an ordered list of AdaptiveRevisitHostQueues used by a
Frontier. |
|
| BdbFrontier
A Frontier using several BerkeleyDB JE Databases to hold its record of known hosts (queues), and pending URIs. |
|
| BdbMultipleWorkQueues
A BerkeleyDB-database-backed structure for holding ordered groupings of CrawlURIs. |
|
| CostAssignmentPolicy
Calculate a integer 'cost' value for the given CrawlURI. |
|
| FrontierJournal
Record of key Frontier happenings. |
|
| HostnameQueueAssignmentPolicy
QueueAssignmentPolicy based on the hostname:port evident in the given CrawlURI. |
|
| QueueAssignmentPolicy
Establishes a mapping from CrawlURIs to String keys (queue names). |
|
| SurtAuthorityQueueAssignmentPolicy
SurtAuthorityQueueAssignmentPolicy based on the surt form of hostname. |
|
| UnitCostAssignmentPolicy
A CostAssignment policy that uses a constant value of 1 for all CrawlURIs. |
|
| WorkQueue
A single queue of related URIs to visit, grouped by a classKey (typically "hostname:port" or similar) |
|
| WorkQueueFrontier
A common Frontier base using several queues to hold pending URIs. |
|
| WorkQueueFrontier.WakeTask
|
|
| Classes in org.archive.crawler.frontier used by org.archive.crawler.postprocessor | |
|---|---|
| AdaptiveRevisitAttributeConstants
Defines static constants for the Adaptive Revisiting module defining data keys in the CrawlURI AList. |
|
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||