com.bigdata.rdf.store
Interface DataLoader.Options

All Superinterfaces:
RDFParserOptions.Options
All Known Subinterfaces:
AbstractTripleStore.Options, BigdataSail.Options, LocalTripleStore.Options, TempTripleStore.Options
Enclosing class:
DataLoader

public static interface DataLoader.Options
extends RDFParserOptions.Options

Options for the DataLoader. Note: The default for RDFParserOptions.Options#PRESERVE_BNODE_IDS is conditionally overridden when LexiconRelation.isStoreBlankNodes() is true.

Version:
$Id: DataLoader.java 6045 2012-02-27 17:33:44Z thompsonbry $
Author:
Bryan Thompson

Field Summary
static String BUFFER_CAPACITY
          Optional property specifying the capacity of the StatementBuffer (default is "100000" statements).
static String CLOSURE
          Optional property controls whether and when the RDFS(+) closure is maintained on the database as documents are loaded (default ).
static String COMMIT
          Optional property specifying whether and when the DataLoader will ITripleStore.commit() the database (default ).
static String DEFAULT_BUFFER_CAPACITY
           
static String DEFAULT_CLOSURE
           
static String DEFAULT_COMMIT
           
static String DEFAULT_FLUSH
          The default value (true) for FLUSH.
static String FLUSH
          When true the StatementBuffer is flushed by each DataLoader.loadData(String, String, RDFFormat) or DataLoader.loadData(String[], String[], RDFFormat[]) operation and when DataLoader.doClosure() is requested.
 
Fields inherited from interface com.bigdata.rdf.rio.RDFParserOptions.Options
DATATYPE_HANDLING, DEFAULT_DATATYPE_HANDLING, DEFAULT_PRESERVE_BNODE_IDS, DEFAULT_STOP_AT_FIRST_ERROR, DEFAULT_VERIFY_DATA, PRESERVE_BNODE_IDS, STOP_AT_FIRST_ERROR, VERIFY_DATA
 

Field Detail

COMMIT

static final String COMMIT
Optional property specifying whether and when the DataLoader will ITripleStore.commit() the database (default ).

Note: commit semantics vary depending on the specific backing store. See ITripleStore.commit().


DEFAULT_COMMIT

static final String DEFAULT_COMMIT

BUFFER_CAPACITY

static final String BUFFER_CAPACITY
Optional property specifying the capacity of the StatementBuffer (default is "100000" statements).


DEFAULT_BUFFER_CAPACITY

static final String DEFAULT_BUFFER_CAPACITY
See Also:
Constant Field Values

CLOSURE

static final String CLOSURE
Optional property controls whether and when the RDFS(+) closure is maintained on the database as documents are loaded (default ).

Note: The InferenceEngine supports a variety of options. When closure is enabled, the caller's Properties will be used to configure an InferenceEngine object to compute the entailments. It is VITAL that the InferenceEngine is always configured in the same manner for a given database with regard to options that control which entailments are computed using forward chaining and which entailments are computed using backward chaining.

Note: When closure is being maintained the caller's Properties will also be used to provision the TempTripleStore.

See Also:
InferenceEngine, InferenceEngine.Options

DEFAULT_CLOSURE

static final String DEFAULT_CLOSURE

FLUSH

static final String FLUSH
When true the StatementBuffer is flushed by each DataLoader.loadData(String, String, RDFFormat) or DataLoader.loadData(String[], String[], RDFFormat[]) operation and when DataLoader.doClosure() is requested. When false the caller is responsible for flushing the DataLoader.buffer. The default is "true".

This behavior MAY be disabled if you want to chain load a bunch of small documents without flushing to the backing store after each document and DataLoader.loadData(String[], String[], RDFFormat[]) is not well-suited to your purposes. This can be much more efficient, approximating the throughput for large document loads. However, the caller MUST invoke DataLoader.endSource() (or DataLoader.doClosure() if appropriate) once all documents are loaded successfully. If an error occurs during the processing of one or more documents then the entire data load should be discarded (this is always true).

This feature is most useful when blank nodes are not in use, but it causes memory to grow when blank nodes are in use and forces statements using blank nodes to be deferred until the application flushes the DataLoader when statement identifiers are enabled.


DEFAULT_FLUSH

static final String DEFAULT_FLUSH
The default value (true) for FLUSH.

See Also:
Constant Field Values


Copyright © 2006-2011 SYSTAP, LLC. All Rights Reserved.