public class JSParseFilter extends Object implements ParseFilter, Parser
| Modifier and Type | Field and Description |
|---|---|
static org.slf4j.Logger |
LOG |
X_POINT_IDX_POINT_ID| Constructor and Description |
|---|
JSParseFilter() |
| Modifier and Type | Method and Description |
|---|---|
Parse |
filter(String url,
WebPage page,
Parse parse,
HTMLMetaTags metaTags,
DocumentFragment doc)
Scan the JavaScript looking for possible
Outlink's |
Configuration |
getConf()
Get the
Configuration object |
Collection<WebPage.Field> |
getFields()
Gets all the fields for a given
WebPage Many datastores need to
setup the mapreduce job by specifying the fields needed. |
Parse |
getParse(String url,
WebPage page)
Set the
Configuration object |
static void |
main(String[] args)
Main method which can be run from command line with the plugin option.
|
void |
setConf(Configuration conf)
Set the
Configuration object |
public Parse filter(String url, WebPage page, Parse parse, HTMLMetaTags metaTags, DocumentFragment doc)
Outlink'spublic Parse getParse(String url, WebPage page)
Configuration objectpublic static void main(String[] args) throws Exception
args - Exceptionpublic void setConf(Configuration conf)
Configuration objectsetConf in interface Configurablepublic Configuration getConf()
Configuration objectgetConf in interface Configurablepublic Collection<WebPage.Field> getFields()
WebPage Many datastores need to
setup the mapreduce job by specifying the fields needed. All extensions
that work on WebPage are able to specify what fields they need.getFields in interface FieldPluggableCopyright © 2015 The Apache Software Foundation