Main Page   Class Hierarchy   Compound List   File List   Compound Members   File Members  

GraphParseHandler Class Reference

Reads anchor links and other document information for use by GraphBuilder. More...

Inheritance diagram for GraphParseHandler:

Inheritance graph
[legend]
Collaboration diagram for GraphParseHandler:

Collaboration graph
[legend]
List of all members.

Public Methods

 GraphParseHandler (GraphBuilder *graphbuilder)
virtual void NewDocument (const Document *doc)
virtual void AddHeader (const char *key, int keylen, const char *value, int valuelen)
 Reads the date and calls gb->NodeSetDate(). More...

virtual void AddAnchor (const char *href, int hreflen)

Private Attributes

GraphBuildergb

Detailed Description

Reads anchor links and other document information for use by GraphBuilder.

GraphParseHandler is called for each document being processed, and interfaces with GraphBuilder, telling it the current document's URL and anchor links, as well as the document's date.

Not all anchor links are passed on to GraphBuilder, however. GraphParseHandler doesn't pass on autolinks, or links to non-html documents.

Definition at line 38 of file parsehandler-graph.cc.


Constructor & Destructor Documentation

GraphParseHandler::GraphParseHandler GraphBuilder   graphbuilder
 

Definition at line 56 of file parsehandler-graph.cc.

References gb.


Member Function Documentation

void GraphParseHandler::AddAnchor const char *    href,
int    hreflen
[virtual]
 

Reimplemented from ParseHandler.

Definition at line 178 of file parsehandler-graph.cc.

References CONTENT_APPLICATION_MS_POWERPOINT, CONTENT_APPLICATION_MSWORD, CONTENT_APPLICATION_PDF, CONTENT_APPLICATION_POSTSCRIPT, CONTENT_APPLICATION_XGZIP, CONTENT_AUDIO_MP3, CONTENT_GOOGLE_OTHER, CONTENT_IMAGE, CONTENT_TEXT_HTML, CONTENT_TEXT_PLAIN, CONTENT_TEXT_RTF, ContentType, GraphBuilder::flags, GraphBuilder::FormatURL(), gb, GraphBuilder::NodeGetURL(), and GraphBuilder::TrieInsertLinkURL().

void GraphParseHandler::AddHeader const char *    key,
int    keylen,
const char *    value,
int    valuelen
[virtual]
 

Reads the date and calls gb->NodeSetDate().

Reimplemented from ParseHandler.

Definition at line 74 of file parsehandler-graph.cc.

References gb, int64, kuint16max, GraphBuilder::NodeSetDate(), NULL, and uint16.

void GraphParseHandler::NewDocument const Document   doc [virtual]
 

Reimplemented from ParseHandler.

Definition at line 61 of file parsehandler-graph.cc.

References gb, GraphBuilder::NodeSetURL(), NULL, Document::url(), and Document::url_after_redirects().


Member Data Documentation

GraphBuilder* GraphParseHandler::gb [private]
 

Definition at line 49 of file parsehandler-graph.cc.

Referenced by AddAnchor(), AddHeader(), GraphParseHandler(), and NewDocument().


Generated on Wed May 29 11:37:25 2002 for MarkovPR by doxygen1.2.15