Inheritance diagram for GraphParseHandler:
Public Methods | |
GraphParseHandler (GraphBuilder *graphbuilder) | |
virtual void | NewDocument (const Document *doc) |
virtual void | AddHeader (const char *key, int keylen, const char *value, int valuelen) |
Reads the date and calls gb->NodeSetDate(). More... | |
virtual void | AddAnchor (const char *href, int hreflen) |
Private Attributes | |
GraphBuilder * | gb |
GraphParseHandler is called for each document being processed, and interfaces with GraphBuilder, telling it the current document's URL and anchor links, as well as the document's date.
Not all anchor links are passed on to GraphBuilder, however. GraphParseHandler doesn't pass on autolinks, or links to non-html documents.
Definition at line 38 of file parsehandler-graph.cc.
|
Definition at line 56 of file parsehandler-graph.cc. References gb. |
|
Reimplemented from ParseHandler. Definition at line 178 of file parsehandler-graph.cc. References CONTENT_APPLICATION_MS_POWERPOINT, CONTENT_APPLICATION_MSWORD, CONTENT_APPLICATION_PDF, CONTENT_APPLICATION_POSTSCRIPT, CONTENT_APPLICATION_XGZIP, CONTENT_AUDIO_MP3, CONTENT_GOOGLE_OTHER, CONTENT_IMAGE, CONTENT_TEXT_HTML, CONTENT_TEXT_PLAIN, CONTENT_TEXT_RTF, ContentType, GraphBuilder::flags, GraphBuilder::FormatURL(), gb, GraphBuilder::NodeGetURL(), and GraphBuilder::TrieInsertLinkURL(). |
|
Reads the date and calls gb->NodeSetDate().
Reimplemented from ParseHandler. Definition at line 74 of file parsehandler-graph.cc. References gb, int64, kuint16max, GraphBuilder::NodeSetDate(), NULL, and uint16. |
|
Reimplemented from ParseHandler. Definition at line 61 of file parsehandler-graph.cc. References gb, GraphBuilder::NodeSetURL(), NULL, Document::url(), and Document::url_after_redirects(). |
|
Definition at line 49 of file parsehandler-graph.cc. Referenced by AddAnchor(), AddHeader(), GraphParseHandler(), and NewDocument(). |