------ Apache Any23 - Extractors ------ The Apache Software Foundation ------ 2011-2012 ~~ Licensed to the Apache Software Foundation (ASF) under one or more ~~ contributor license agreements. See the NOTICE file distributed with ~~ this work for additional information regarding copyright ownership. ~~ The ASF licenses this file to You under the Apache License, Version 2.0 ~~ (the "License"); you may not use this file except in compliance with ~~ the License. You may obtain a copy of the License at ~~ ~~ http://www.apache.org/licenses/LICENSE-2.0 ~~ ~~ Unless required by applicable law or agreed to in writing, software ~~ distributed under the License is distributed on an "AS IS" BASIS, ~~ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. ~~ See the License for the specific language governing permissions and ~~ limitations under the License. Apache Any23 Extractors This page enlists all the Apache Any23 Extractors (see source code {{{./xref/org/apache/any23/extractor/package-summary.html}package}}). * Microformat Extractors The following extractors refer to the {{{http://microformats.org/}Microformats specifications}}. Specific details about *Microformats* extractors can be found {{{./dev-microformat-extractors.html}here}}. In particular the *Microformats Nesting* representation policy is described {{{./dev-microformat-extractors.html#microformat-nesting}here}}. {{{./xref/org/apache/any23/extractor/html/AdrExtractor.html}AdrExtractor}} {{{./xref/org/apache/any23/extractor/html/GeoExtractor.html}GeoExtractor}} {{{./xref/org/apache/any23/extractor/html/HCalendarExtractor.html}HCalendar}} {{{./xref/org/apache/any23/extractor/html/HCardExtractor.html}HCard}} {{{./xref/org/apache/any23/extractor/html/HListingExtractor.html}HListing}} {{{./xref/org/apache/any23/extractor/html/HResumeExtractor.html}HResume}} {{{./xref/org/apache/any23/extractor/html/HReviewExtractor.html}HReview}} {{{./xref/org/apache/any23/extractor/html/SpeciesExtractor.html}SpeciesExtractor}} {{{./xref/org/apache/any23/extractor/html/LicenseExtractor.html}LicenseExtractor}} {{{./xref/org/apache/any23/extractor/html/XFNExtractor.html}XFNExtractor}} {{{./xref/org/apache/any23/extractor/html/HRecipeExtractor.html}HRecipeExtractor}} * RDFa [1.0 , 1.1] The following extractors refer to the {{{http://www.w3.org/TR/rdfa-syntax/}RDFa 1.0}} and {{{http://www.w3.org/TR/rdfa-core/}RDFa 1.1}} specifications. {{{./xref/org/apache/any23/extractor/rdfa/RDFaExtractor.html}RDFaExtractor}} * Microdata The following extractors refer to the {{{http://dev.w3.org/html5/md/}Microdata specifications}}. {{{./xref/org/apache/any23/extractor/microdata/MicrodataExtractor.html}MicrodataExtractor}} * RDF {{{./xref/org/apache/any23/extractor/rdf/RDFXMLExtractor.html}RDFXMLExtractor}} {{{./xref/org/apache/any23/extractor/rdf/NQuadsExtractor.html}NQuadsExtractor}} {{{./xref/org/apache/any23/extractor/rdf/TurtleExtractor.html}TurtleExtractor}} {{{./xref/org/apache/any23/extractor/rdf/NTriplesExtractor.html}NTriplesExtractor}} * Metadata Extractors {{{./xref/org/apache/any23/extractor/html/TitleExtractor.html}TitleExtractor}} {{{./xref/org/apache/any23/extractor/html/HTMLMetaExtractor.html}HTMLMetaExtractor}} {{{./xref/org/apache/any23/extractor/html/HeadLinkExtractor.html}HeadLinkExtractor}} {{{./xref/org/apache/any23/extractor/html/ICBMExtractor.html}ICBMExtractor}} {{{./xref/org/apache/any23/extractor/html/TurtleHTMLExtractor.html}TurtleHTMLExtractor}} * Content Extractors {{{./xref/org/apache/any23/extractor/xpath/XPathExtractor.html}XPath Extractor}} (<>) {{{./xref/org/apache/any23/extractor/csv/CSVExtractor.html}CSV Extractor}} (See the extraction {{{./dev-csv-extractor.html}algorithm}}.) Get more documentation It is possible to generate the list of all the available extractors invoking the following command: +------------------------------------------------------------ /bin$ any23tools ExtractorDocumentation -list +------------------------------------------------------------