|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.archive.crawler.frontier.CostAssignmentPolicy
org.archive.crawler.frontier.UnitCostAssignmentPolicy
org.archive.crawler.frontier.AntiCalendarCostAssignmentPolicy
public class AntiCalendarCostAssignmentPolicy
CostAssignmentPolicy that further penalizes URIs with calendar-suggestive strings in them, with an extra unit of cost. Will catch some 'innocent' URIs, but only when uncaught large-volume chaff is ranked higher than caught 'wheat' will this cause notable problems.
| Field Summary | |
|---|---|
static java.lang.String |
CALENDARISH
|
| Constructor Summary | |
|---|---|
AntiCalendarCostAssignmentPolicy()
|
|
| Method Summary | |
|---|---|
int |
costOf(CrawlURI curi)
|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Field Detail |
|---|
public static java.lang.String CALENDARISH
| Constructor Detail |
|---|
public AntiCalendarCostAssignmentPolicy()
| Method Detail |
|---|
public int costOf(CrawlURI curi)
costOf in class UnitCostAssignmentPolicy
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||