Indexing
Disambiguation[edit source]
Entities keyworded Indexing[edit source]
Indexing Terms | Mentioned In |
---|
Indexing Domains:
Indexing involves 2 steps:
- Feeding resources from a ResourceSilo into a Datastore
- Applying an Indexer to resources in a Datastore
Feeding resources from a ResourceSilo into a Datastore[edit | edit source]
See SystemComponent "datastores"
AAAA-Ingestion "From E-Mail account to Datastore by"
Data fed? | |
Configuration | |
Subsequent pipeline | |
Tools | |
AAAA-Ingestion "From File system to Datastore by https://github.com/dataspects/dsdocument-cli"
Data fed? | |
Configuration |
Configure the datastore: <Datastore ID> automatic (e.g. 12) <Datastore Label> Shown in information sources list on https://ui.dataspects.com/search <Datastore API Key> automatic (e.g. c8b89bc3-0139-11wa-8ef3-8c164563716b) <Datastore Doc Regex> Only file names matching this regex will be fed to the datastore (Regex Tester - Golang) Configure and run the feeder to index matching files in and below the current folder: user@workstation:/yourfolder$ ./dsdocument-cli \ --id <Datastore ID> # From https://ui.dataspects.com/datastores/files \ --url https://datastore.dataspects.com \ --key <Datastore API Key> # From https://ui.dataspects.com/datastores/files |
Subsequent pipeline | |
Tools | |
Data fed? | |
Configuration |
https://github.com/dataspects/DataspectsMediaWikiFeeder/blob/master/maintenance/feedAll.php |
Subsequent pipeline | |
Tools | |
AAAA-Ingestion "From MediaWiki to Datastore by https://github.com/dataspects/mediawiki-workbench"
Data fed? | |
Configuration | |
Subsequent pipeline | |
Tools | |
AAAA-Ingestion "From Repository to dataspectsd by"
Data fed? | |
Configuration |
repository-id: "ping"
regex: ".*.go$"
directoryToIndex:
elastic-search:
host: http://192.168.1.36
port: 9200
username:
password:
neo4j:
host: bolt://192.168.1.36
port: 7687
username:
password:
|
Subsequent pipeline | |
Tools | |