Constructor and Description |
---|
ExternalExcluder(ObjectFilter<CaptureSearchResult> filter) |
Modifier and Type | Class and Description |
---|---|
class |
CustomPolicyOracleFilter
Oracle Filter Implementation that supports custom policies in addition to
allow, block, block-message and robots
The policy is stored in the CaptureSearchResult
Note: this class is being re-designed to allow for run-time customization
(i.e. with Spring config):
Redefine
Policy as an interface + abstract implementation. |
class |
OracleExclusionFilter
ExclusionFilter implementation that queries remote "Exclusion Oracle"
with AccessControlClient to determine which SearchResult s can
be exposed. |
Modifier and Type | Class and Description |
---|---|
class |
RemoteExclusionFilter
Deprecated.
superseded by ExclusionOracle
|
Modifier and Type | Class and Description |
---|---|
class |
HRobotExclusionFilter |
class |
RobotExclusionFilter
CaptureSearchResult Filter that uses a LiveWebCache to retrieve robots.txt
documents from the live web, and filters SearchResults based on the rules
therein.
|
Modifier and Type | Class and Description |
---|---|
class |
RedisRobotExclusionFilter |
Modifier and Type | Class and Description |
---|---|
class |
StaticListExclusionFilter |
class |
StaticMapExclusionFilter |
Modifier and Type | Method and Description |
---|---|
void |
WaybackRequest.addResultFilter(ObjectFilter<CaptureSearchResult> resultFilter) |
Modifier and Type | Class and Description |
---|---|
class |
NotableResultExtractor
ObjectFilter of CaptureSearchResult objects, which includes all results
but keeps track of "notable" results, in particular, those relative to a
specific date: first, previous, closest, next, and last
|
Modifier and Type | Method and Description |
---|---|
ObjectFilter<CaptureSearchResult> |
LocalResourceIndex.getAnnotater() |
ObjectFilter<CaptureSearchResult> |
LocalResourceIndex.getFilter() |
protected ObjectFilter<CaptureSearchResult> |
RemoteResourceIndex.getSearchResultFilters(WaybackRequest wbRequest,
ClosestTrackingCaptureFilterGroup closestGroup) |
Modifier and Type | Method and Description |
---|---|
protected SearchResults |
RemoteResourceIndex.documentToSearchResults(Document document,
ObjectFilter<CaptureSearchResult> filter) |
void |
LocalResourceIndex.setAnnotater(ObjectFilter<CaptureSearchResult> annotater) |
void |
LocalResourceIndex.setFilter(ObjectFilter<CaptureSearchResult> filter) |
protected SearchResults |
RemoteResourceIndex.urlToSearchResults(String requestUrl,
ObjectFilter<CaptureSearchResult> filter) |
Modifier and Type | Method and Description |
---|---|
List<ObjectFilter<CaptureSearchResult>> |
AnnotatingCaptureFilterGroup.getFilters() |
List<ObjectFilter<CaptureSearchResult>> |
ExclusionCaptureFilterGroup.getFilters() |
List<ObjectFilter<CaptureSearchResult>> |
QueryCaptureFilterGroup.getFilters() |
List<ObjectFilter<CaptureSearchResult>> |
CaptureFilterGroup.getFilters() |
List<ObjectFilter<T>> |
WindowFilterGroup.getFilters() |
List<ObjectFilter<CaptureSearchResult>> |
CoreCaptureFilterGroup.getFilters() |
List<ObjectFilter<CaptureSearchResult>> |
ClosestTrackingCaptureFilterGroup.getFilters() |
List<ObjectFilter<CaptureSearchResult>> |
AccessPointCaptureFilterGroup.getFilters() |
Modifier and Type | Class and Description |
---|---|
class |
BeanShellFilter |
class |
ClosestResultTrackingFilter
Class which observes CaptureSearchResults, keeping track of the closest
result found to a given date.
|
class |
CompositeExclusionFilter
SearchResultFilter that abstracts multiple SearchResultFilters -- if all
filters return INCLUDE, then the result is included, but the first to
return ABORT or EXCLUDE short-circuits the rest
|
class |
CompositeFilter
Simple composite ObjectFilter - which includes only if all components include
|
class |
ConditionalGetAnnotationFilter
WARC file allows 2 forms of deduplication.
|
class |
CounterFilter
SearchResultFilter which INCLUDEs all checked records, but keeps track of
how many were seen during processing.
|
class |
DateEmbargoFilter |
class |
DateRangeFilter
SearchResultFilter that excludes records outside of start and end range.
|
class |
DuplicateHashFilter |
class |
DuplicateRecordFilter
ObjectFilter which omits exact duplicate URL+date records from a stream
of CaptureSearchResult.
|
class |
DuplicateTimestampFilter |
class |
EndDateFilter
SearchResultFilter which includes all records until 1 is found beyond end
date then it aborts processing.
|
class |
ExclusionFilter |
class |
FilePrefixDateEmbargoFilter
Blocks only files matching a given prefix, iff they are newer than a given
embargo period.
|
class |
FilePrefixFilter |
class |
FileRegexFilter |
class |
GuardRailFilter
SearchResultFilter which aborts processing when too many records have been
inspected.
|
class |
HostMatchFilter
SearchResultFilter which includes only records that have original host
matching.
|
class |
HttpCodeFilter
ObjectFilter which allows including or excluding results based on the
Http response code.
|
class |
MimeTypeFilter
SearchResultFilter which includes only records matching one or more supplied
Mime-Types.
|
class |
OracleAnnotationFilter
SearchResult filter class which contacts an access-control Oracle, using
information from the public comment field to annotate SearchResult objects.
|
class |
SchemeMatchFilter
ObjectFilter which omits CaptureSearchResult objects if their scheme does not
match the specified scheme.
|
class |
SelfRedirectFilter
SearchResultFilter which INCLUDEs all records, unless they redirect to
themselves, via whatever URL purification schemes are in use.
|
class |
StartDateFilter
SearchResultFilter which includes all records until 1 is found before start
date then it aborts processing.
|
class |
UrlMatchFilter
SearchResultFilter which includes only records that have url matching
aborts as soon as url does not match.
|
class |
UrlPrefixMatchFilter
SearchResultFilter which includes any URL which begins with a given prefix,
and aborts processing when any URL does not match the prefix.
|
class |
UserInfoInAuthorityFilter
Class which omits CaptureSearchResults that have and '@' in the original URL
field, if that '@' is after the scheme, and before the first '/' or ':'
|
class |
WARCRevisitAnnotationFilter
Filter class that observes a stream of SearchResults tracking for each
complete record, a mapping of that records Digest to:
Arc/Warc Filename
Arc/Warc offset
HTTP Response
MIME-Type
Redirect URL
If subsequent SearchResults are missing these fields ("-") and the Digest
field is in the map, then the SearchResults missing fields are replaced with
the values from the previously seen record with the same digest, and an
additional annotation field is added.
|
class |
WindowEndFilter<T>
SearchResultFitler that includes the first N records seen.
|
class |
WindowStartFilter<T>
SearchResultFitler that omits the first N records seen.
|
Modifier and Type | Method and Description |
---|---|
List<ObjectFilter<CaptureSearchResult>> |
CompositeFilter.getFilters() |
Modifier and Type | Method and Description |
---|---|
void |
CompositeFilter.setFilters(List<ObjectFilter<CaptureSearchResult>> filters) |
Modifier and Type | Class and Description |
---|---|
class |
ObjectFilterChain<E>
ObjectFilterChain implements AND logic to chain together multiple
ObjectFilters into a composite.
|
Modifier and Type | Method and Description |
---|---|
ArrayList<ObjectFilter<E>> |
ObjectFilterChain.getFilters() |
Modifier and Type | Method and Description |
---|---|
void |
ObjectFilterChain.addFilter(ObjectFilter<E> filter) |
Modifier and Type | Method and Description |
---|---|
void |
ObjectFilterChain.addFilters(Collection<ObjectFilter<E>> list) |
void |
ObjectFilterChain.setFilters(ArrayList<ObjectFilter<E>> filters) |
Constructor and Description |
---|
ObjectFilterIterator(Iterator<T> itr,
ObjectFilter<T> filter) |
Modifier and Type | Class and Description |
---|---|
class |
FileRegexParamFilterAndFactory |
Modifier and Type | Method and Description |
---|---|
ObjectFilter<CaptureSearchResult> |
FileRegexParamFilterAndFactory.get(AccessPoint ap) |
ObjectFilter<CaptureSearchResult> |
CustomMultiFilterFactory.get(AccessPoint ap) |
ObjectFilter<CaptureSearchResult> |
DuplicateFiltersFactory.get(AccessPoint ap) |
ObjectFilter<CaptureSearchResult> |
PrefixEmbargoResultFilterFactory.get(AccessPoint ap) |
ObjectFilter<CaptureSearchResult> |
CustomResultFilterFactory.get(AccessPoint ap) |
Copyright © 2005–2017 IIPC. All rights reserved.