org.archive.crawler.extractor
Classes 
AggressiveExtractorHTML
ChangeEvaluator
CrawlUriSWFAction
CustomSWFTags
Extractor
ExtractorCSS
ExtractorDOC
ExtractorHTML
ExtractorHTTP
ExtractorImpliedURI
ExtractorJS
ExtractorPDF
ExtractorSWF
ExtractorTool
ExtractorUniversal
ExtractorURI
ExtractorXML
HTTPContentDigest
JerichoExtractorHTML
Link
PDFParser
TrapSuppressExtractor