Class and Description |
---|
AbortChecker
This class furnishes an abort signal whenever the job activity says it should.
|
AuthenticationCredentials
This interface describes immutable classes which represents authentication information for all kinds of authentication.
|
CookieManager
This class manages the database table into which we write cookies.
|
CookieManager.CookiesCacheClass
Cache class for robots.
|
CookieManager.CookiesDescription
This is the object description for a session key object.
|
CredentialsDescription
This class describes credential information pulled from a configuration.
|
CredentialsDescription.SessionCredentialParameter
Session credential parameter class
|
DataCache
This class is a cache of a specific URL's data.
|
DataCache.DocumentData
This class represents everything we need to know about a document that's getting passed from the
getDocumentVersions() phase to the processDocuments() phase.
|
DNSManager
This class manages the database table into which we DNS entries for hosts.
|
DNSManager.DNSCacheClass
Cache class for robots.
|
DNSManager.DNSInfo
This is a cached data item.
|
DNSManager.HostDescription
This is the object description for a robots host object.
|
FindHandler
This class is used to discover links in a session login context
|
FormData
This interface describes the form data gleaned from an HTML page.
|
FormDataAccumulator
This class accumulates form data and allows overrides
|
FormDataElement
This interface describes individual form data elements, for form submission.
|
IDiscoveredLinkHandler
This interface describes the functionality needed by a link extractor to note a discovered link.
|
IHTMLHandler
This interface describes the functionality needed by an HTML processor in order to handle an HTML document.
|
IMetaTagHandler
This interface describes the functionality needed by a parser to handle metadata tags.
|
IRedirectionHandler
This interface describes the functionality needed by an redirection processor in order to handle a redirection.
|
IThrottledConnection
This interface represents an established connection to a URL.
|
IXMLHandler
This interface describes the functionality needed by an XML processor in order to handle an XML document.
|
LinkParseState
This class recognizes and interprets all links
|
LoginCookies
This interface describes cookies obtained during sequential authentication.
|
LoginParameters
This interface describes login parameters to be used to submit a page during sequential authentication.
|
MetaParseState
This class recognizes and interprets all meta tags
|
PageCredentials
This interface describes immutable classes which represents authentication information for page-based authentication.
|
RobotsManager
This class manages the database table into which we write robots.txt files for hosts.
|
RobotsManager.HostDescription
This is the object description for a robots host object.
|
RobotsManager.RobotsCacheClass
Cache class for robots.
|
RobotsManager.RobotsData
This is a cached data item.
|
ScriptParseState
This class interprets the tag stream generated by the HTMLParseState class, and causes script sections to be skipped
|
SequenceCredentials
This interface describes immutable classes which represents authentication information for sequence-based authentication.
|
ThrottleDescription
This class describes complex throttling criteria pulled from a configuration.
|
ThrottleDescription.ThrottleItem
Class representing an individual throttle item.
|
ThrottledFetcher.ConnectionPool
Each connection pool has identical connections we can draw on.
|
ThrottledFetcher.ConnectionPoolKey
Connection pool key
|
ThrottledFetcher.ExecuteMethodThread
This thread does the actual socket communication with the server.
|
ThrottledFetcher.ThrottledConnection
Throttled connections.
|
TrustsDescription
This class describes trust information pulled from a configuration.
|
WebcrawlerConnector.CanonicalizationPolicies
Class representing a list of canonicalization rules
|
WebcrawlerConnector.CanonicalizationPolicy
Class representing a URL regular expression match, for the purposes of determining canonicalization policy
|
WebcrawlerConnector.DocumentURLFilter
This class describes the url filtering information (for crawling and indexing) obtained from a digested DocumentSpecification.
|
WebcrawlerConnector.EvaluatorToken
Evaluator token.
|
WebcrawlerConnector.FetchStatus |
WebcrawlerConnector.MappingRule
Class representing a mapping rule
|
WebcrawlerConnector.MappingRules
Class that represents all mappings
|
WebcrawlerConnector.NameValue
Name/value class
|
WebcrawlerConnector.ProcessActivityLinkHandler
This class is the handler for links that get added into a IProcessActivity object.
|
WebURL
Replacement class for java.net.URI, which is broken in many ways.
|