All Classes Interface Summary Class Summary Enum Summary Exception Summary Annotation Types Summary
| Class |
Description |
| AboutNotURIRule |
This rule is able to detect whether an about value is a valid URL or otherwise is a valid relative URL.
|
| AcceptHeaderBuilder |
Concatenates a collection of MIME specs in "type/subtype;q=x.x" notation into an HTTP Accept header value, and
removes duplicates and types covered by wildcards.
|
| AdrExtractor |
Extractor for the adr microformat.
|
| AdrExtractorFactory |
|
| Any23 |
A facade with convenience methods for typical Any23 extraction operations.
|
| Any23ValueFactoryWrapper |
Any23 specialization of the ValueFactory.
|
| BaseRDFExtractor |
Base class for a generic RDF Extractor.ContentExtractor.
|
| BenchmarkTripleHandler |
TripleHandler decorator useful to perform benchmarking.
|
| ByteArrayDocumentSource |
ByteArray implementation of DocumentSource.
|
| CompositeTripleHandler |
A TripleHandler multi decorator, that wraps zero or more other triple handlers and dispatches all events to
each of them.
|
| CountingTripleHandler |
A simple TripleHandler that merely counts the number of triples it has received.
|
| CSVExtractor |
This extractor produces RDF from a CSV file .
|
| CSVExtractorFactory |
|
| DefaultDOMDocument |
This class wraps the DOM document.
|
| DefaultHTTPClient |
|
| DefaultHTTPClientConfiguration |
|
| DefaultRuleContext |
|
| DefaultValidationReport |
|
| DefaultValidationReportBuilder |
|
| DefaultValidator |
|
| DiscoveryUtils |
This class provides utility methods for discovering classes in packages.
|
| DocumentReport |
Represents the validationReportBuilder generated by a the TagSoupParser when a document is retrieved and
validated.
|
| DOMDocument |
This interface models a document to be processed by the Validator.
|
| DomUtils |
This class provides utility methods for DOM manipulation.
|
| ElementsProcessor |
|
| ElementsProcessor.ModelHolder |
A model holder describes the two required parameters which makes a model useful in further processing: a root
node and model itself.
|
| EmbeddedJSONLDExtractor |
This extractor represents the HTML script tags used to embed blocks of data in documents.
|
| EmbeddedJSONLDExtractorFactory |
|
| EmptyValidationReport |
|
| EntityBasedMicroformatExtractor |
Base class for microformat extractors based on entities.
|
| ExampleInputOutput |
A reporter for example input and output of an extractor.
|
| ExtractionContextBlocker |
A wrapper around a TripleHandler that can block and unblock calls to the handler, either for the entire
document, or for individual ExtractionContexts.
|
| ExtractionReport |
This class contains some statistics and general information about an extraction.
|
| ExtractionResultImpl |
A default implementation of ExtractionResult; it receives extraction output from one Extractor
working on one document, and passes the output on to a TripleHandler.
|
| ExtractorRegistryImpl |
Singleton class acting as a register for all the various Extractor.
|
| FileDocumentSource |
File implementation of DocumentSource.
|
| FileUtils |
Utility class for handling files.
|
| Fix |
This interface models a fix for an issue.
|
| FunctionalSyntaxExtractor |
|
| FunctionalSyntaxExtractorFactory |
|
| GeoExtractor |
Extractor for the Geo microformat.
|
| GeoExtractorFactory |
|
| HAdrExtractor |
Extractor for the h-adr microformat.
|
| HAdrExtractorFactory |
|
| HCalendarExtractor |
|
| HCalendarExtractorFactory |
|
| HCardExtractor |
Extractor for the hCard microformat.
|
| HCardExtractor |
Extractor for the h-Card microformat.
|
| HCardExtractorFactory |
|
| HCardExtractorFactory |
|
| HCardName |
An HCard name, consisting of various parts.
|
| HeadLinkExtractor |
This Extractor.TagSoupDOMExtractor implementation retrieves the LINKs
declared within the HTML/HEAD page header.
|
| HeadLinkExtractorFactory |
|
| HEntryExtractor |
Extractor for the h-entry microformat.
|
| HEntryExtractorFactory |
Extractor for the h-entry microformat.
|
| HEventExtractor |
Extractor for the h-event microformat.
|
| HEventExtractorFactory |
|
| HGeoExtractor |
Extractor for the h-geo microformat.
|
| HGeoExtractorFactory |
|
| HItemExtractor |
Extractor for the h-item microformat.
|
| HItemExtractorFactory |
|
| HListingExtractor |
|
| HListingExtractorFactory |
|
| HProductExtractor |
|
| HProductExtractorFactory |
|
| HRecipeExtractor |
Extractor for the hRecipe microformat.
|
| HRecipeExtractor |
Extractor for the hRecipe microformat.
|
| HRecipeExtractorFactory |
|
| HRecipeExtractorFactory |
|
| HResumeExtractor |
Extractor for the hResume microformat.
|
| HResumeExtractor |
Extractor for the hResume microformat.
|
| HResumeExtractorFactory |
|
| HResumeExtractorFactory |
|
| HReviewAggregateExtractor |
|
| HReviewAggregateExtractorFactory |
|
| HReviewExtractor |
Extractor for the hReview microformat.
|
| HReviewExtractorFactory |
|
| HTMLDocument |
A wrapper around the DOM representation of an HTML document.
|
| HTMLDocument.TextField |
This class represents a text extracted from the HTML DOM related to the node from which such test has been
retrieved.
|
| HTMLMetaExtractor |
|
| HTMLMetaExtractorFactory |
|
| HTTPClient |
|
| HTTPClientConfiguration |
|
| HTTPDocumentSource |
Http implementation of DocumentSource.
|
| ICalExtractor |
|
| ICalExtractorFactory |
|
| ICBMExtractor |
Extractor for "ICBM coordinates" provided as META headers in the head of an HTML page.
|
| ICBMExtractorFactory |
|
| IgnoreAccidentalRDFa |
A TripleHandler that suppresses output of the RDFa parser if the document only contains "accidental" RDFa,
like stylesheet links and other non-RDFa uses of HTML's
|
| IgnoreTitlesOfEmptyDocuments |
A TripleHandler that suppresses output of the TitleExtractor unless some other triples could be
parsed from the document.
|
| Includes |
This annotation could be used to decorate a MicroformatExtractor to represent which of the other
Microformats could it nest.
|
| Includes |
This annotation could be used to decorate a MicroformatExtractor to represent which of the other
Microformats could it nest.
|
| Item |
Base class for Microdata items.
|
| ItemProp |
Describes a Microdata item property.
|
| ItemPropValue |
Describes a possible value for a Microdata item property.
|
| ItemPropValue.Type |
Supported types.
|
| ItemScope |
This class describes a Microdata itemscope.
|
| JCalExtractor |
|
| JCalExtractorFactory |
|
| JSONLDExtractor |
Concrete implementation of Extractor.ContentExtractor handling
JSON-LD format.
|
| JSONLDExtractorFactory |
|
| JSONLDWriter |
Implementation of JSON-LD TripleWriter.
|
| JSONLDWriterFactory |
|
| JSONWriter |
Deprecated.
|
| JSONWriterFactory |
Deprecated.
|
| JsoupUtils |
|
| LicenseExtractor |
|
| LicenseExtractorFactory |
|
| LocalCopyFactory |
A factory that creates local copies of DocumentSources.
|
| LoggingTripleHandler |
Triple handler decorator useful for logging purposes.
|
| LogUtils |
Deprecated.
|
| ManchesterSyntaxExtractor |
Concrete implementation of Extractor.ContentExtractor handling
Manchester Syntax format.
|
| ManchesterSyntaxExtractorFactory |
|
| MathUtils |
Mathematical utility functions.
|
| MemCopyFactory |
Creates local copies of DocumentSource by reading them into an in-memory buffer.
|
| MetaNameMisuseFix |
Fixes the misuse of the meta name attribute.
|
| MetaNameMisuseRule |
Checks whether the meta attribute name is used to contain a property.
|
| MicrodataExtractor |
Default implementation of Microdata extractor, based on
Extractor.TagSoupDOMExtractor.
|
| MicrodataExtractorFactory |
|
| MicrodataParser |
This class provides utility methods for handling Microdata nodes contained within a DOM document.
|
| MicrodataParserException |
Defines an exception occurring while parsing Microdata.
|
| MicrodataParserReport |
|
| MicroformatExtractor |
|
| Microformats2Prefixes |
|
| MissingItemscopeAttributeValueFix |
|
| MissingItemscopeAttributeValueRule |
This fixes missing attribute values for the 'itemscope' attribute Typically when such a snippet of XHTML is fed
through the RDFa11Extractor, and subsequently to Sesame's SesameRDFaParser,
it will result in the following behavior.
|
| MissingOpenGraphNamespaceRule |
This rule detects the issue of missing Open Graph namespace.
|
| NQuadsExtractor |
Concrete implementation of Extractor.ContentExtractor handling
N-Quads format.
|
| NQuadsExtractorFactory |
|
| NQuadsWriter |
Implementation of an N-Quads TripleWriter.
|
| NQuadsWriterFactory |
|
| NTriplesExtractor |
Concrete implementation of Extractor.ContentExtractor handling NTriples
NTriples format.
|
| NTriplesExtractorFactory |
|
| NTriplesWriter |
Implementation of an N-Triples TripleWriter.
|
| NTriplesWriterFactory |
|
| OpenGraphNamespaceFix |
This fixes the missing Open Graph protocol.
|
| PopularPrefixes |
This class act as a container for various well-known and adopted RDF Vocabulary prefixes.
|
| QuadTemplate |
This class models a NQuads template, that is a quadruple in which any component can be a variable.
|
| RDFa11Extractor |
Extractor implementation for RDFa
1.1 specification.
|
| RDFa11ExtractorFactory |
|
| RDFa11Parser |
Deprecated.
|
| RDFa11ParserException |
Deprecated.
|
| RDFaExtractor |
Extractor implementation for RDFa
1.0 specification.
|
| RDFaExtractorFactory |
|
| RDFHandlerAdapter |
An RDFHandler that relays statements and prefix definitions to an ExtractionResult.
|
| RDFParserFactory |
This factory provides a common logic for creating and configuring correctly any RDF parser used within the
library.
|
| RDFSchemaUtils |
This class provides a set of methods for generating RDF Schema.
|
| RDFUtils |
Basic class providing a set of utility methods when dealing with RDF.
|
| RDFWriterTripleHandler |
A TripleHandler that writes triples to a Sesame RDFWriter, eg for serialization
using one of Sesame's writers.
|
| RDFXMLExtractor |
Concrete implementation of Extractor.ContentExtractor able to perform the
extraction on RDF/XML documents.
|
| RDFXMLExtractorFactory |
|
| RDFXMLWriter |
RDF/XML TripleWriter implementation.
|
| RDFXMLWriterFactory |
|
| ReaderInputStream |
Adapts a Reader as an InputStream.
|
| ReportingTripleHandler |
A TripleHandler that collects various information about the extraction process, such as the extractors used
and the total number of triples.
|
| RepositoryWriter |
An RDF4J repository triple writer.
|
| Rule |
Models a rule for an issue detection.
|
| RuleContext<T> |
This context is used to pass data from a Rule to a Fix.
|
| SerializationException |
|
| SimpleExtractorFactory<T extends org.apache.any23.extractor.Extractor<?>> |
This class is a simple and default-like implementation of ExtractorFactory.
|
| SingleDocumentExtraction |
This class acts as a facade where all extractors (for a given MIMEType) can be called on a single document.
|
| SingleDocumentExtractionReport |
|
| SpanCloserInputStream |
Extension of InputStream meant to detect and replace any occurrence of inline span:
|
| SpeciesExtractor |
|
| SpeciesExtractorFactory |
|
| StreamUtils |
Contains general utility functions for handling streams.
|
| StringDocumentSource |
String implementation of DocumentSource.
|
| StringUtils |
This class provides a set of string utility methods.
|
| TagSoupExtractionResult |
This interface models a specific ExtractionResult able to collect property roots generated by HTML
Microformat extractions.
|
| TagSoupExtractionResult.PropertyPath |
Defines a property path object.
|
| TagSoupExtractionResult.ResourceRoot |
Defines a property root object.
|
| TagSoupParser |
|
| TagSoupParser.ElementLocation |
Describes a DOM Element location.
|
| TemplateGraph |
Represents an Quad graph IRI template.
|
| TemplateObject |
Represents a Quad object template.
|
| TemplateObject.Type |
Supported object types.
|
| TemplatePredicate |
Represents a Quad predicate template.
|
| TemplateSubject |
Represents a Quad subject template.
|
| TemplateSubject.Type |
Supported subject types.
|
| TemplateXPathExtractionRule |
|
| TemplateXPathExtractionRuleImpl |
|
| Term<T extends org.eclipse.rdf4j.model.Value> |
Represents a generic template term.
|
| TitleExtractor |
Extracts the value of the <title> element of an HTML or XHTML page.
|
| TitleExtractorFactory |
|
| TripleWriterHandler |
This class connects a TripleHandler to a TripleWriter by writing received data.
|
| TriXExtractor |
Concrete implementation of Extractor.ContentExtractor to perform extraction on
TriX documents.
|
| TriXExtractorFactory |
|
| TriXWriter |
TriX TripleWriter implementation.
|
| TriXWriterFactory |
|
| TurtleExtractor |
Concrete implementation of Extractor.ContentExtractor able to perform the
extraction on Turtle documents.
|
| TurtleExtractorFactory |
|
| TurtleHTMLExtractor |
Extractor for Turtle/N3 format embedded within HTML script tags.
|
| TurtleHTMLExtractorFactory |
|
| TurtleWriter |
N3 notation TripleWriter implementation.
|
| TurtleWriterFactory |
|
| URIListWriter |
This writer simply produces a list of unique IRI present in the subject or in the object of every single
extracted RDF Statement.
|
| URIListWriterFactory |
|
| URLUtils |
Contains general utility functions for handling URLs.
|
| ValidationReport |
This class contains the report of a validation performed by the Validator class.
|
| ValidationReport.Error |
An error occurred while performing the validation process.
|
| ValidationReport.FixError |
An error occurred while executing a fix.
|
| ValidationReport.Issue |
An issue found during the validation process.
|
| ValidationReport.IssueLevel |
Defines the different issue levels.
|
| ValidationReport.RuleActivation |
This class describes the activation of a rule.
|
| ValidationReport.RuleError |
An error occurred while executing a rule.
|
| ValidationReportBuilder |
The report interface is used to generate diagnostics about validation.
|
| ValidationReportSerializer |
Defines a serializer for validation reports.
|
| Validator |
The validator class allows to perform validation - correction of related to HTML Document
instances.
|
| ValidatorException |
Models any exception raised by the Validator.
|
| Variable |
|
| WriterSettings |
This class encapsulates commonly supported settings for TripleWriter implementations.
|
| XCalExtractor |
|
| XCalExtractorFactory |
|
| XFNExtractor |
Extractor for the XFN microformat.
|
| XFNExtractorFactory |
|
| XMLValidationReportSerializer |
|
| XMLValidationReportSerializer.NodeName |
Allows to specify a custom node name.
|
| XPathExtractionRule |
|
| XPathExtractor |
Implementation of an Extractor.TagSoupDOMExtractor able to apply
XPathExtractionRules and generate quads.
|
| XPathExtractorFactory |
|
| XSLTStylesheet |
An XSLT stylesheet loaded from an InputStream, can be applied to DOM trees and writes the result to a Writer.
|
| XSLTStylesheetException |
|
| YAMLExtractor |
|
| YAMLExtractorFactory |
|
| YAMLProcessorException |
|