Status: | effective |
Progress: | 100% |
Version: | 3.2.0+ |
This help page explains how the contents import is done with help of the "Importer" feature provided since Semantic MediaWiki 2.5.0Released on 14 March 2017 and compatible with MW 1.23.0 - 1.29.x..1 In short, import is done upon installation of Semantic MediaWiki from legitimate files located in the directory specified with configuration parameter $smwgImportFileDirs
Sets the contents import directory for legitimate files to be imported upon installation of Semantic MediaWiki which are available in "JSON" format1, "XML" format2 or as text files2 and contain the version specified with configuration parameter $smwgImportReqVersion
Sets the contents import file version for legitimate files to be imported upon installation of Semantic MediaWiki. After every import maintenance script "setupStore.php"Allows to set up the data backend/store has to be run.
In case the contents import was skipped using the --skip-import
option on either maintenance script "setupStore.php"Allows to set up the data backend/store or maintenance script "update.php" (MediaWiki.org) and the import process can be triggered manually with maintenance script "runImport.php"Allows to import content from import files3.
The objective of the Importer
is to provide a simple mechanism for deploying data structures and support information in a loose yet structured form during the installation (setup) process.
Import definitions
$smwgImportFileDirs
defines import directories from where content can be imported.
Import definitions are defined using a JSON
format which provides the structural means and is considered easily extendable by end-users.
The import files are sorted and therefore sequentially processed based on the file name. In case where content relies on other content an appropriate naming convention should be followed to ensure required definitions are imported in the expected order.
Semantic MediaWiki deploys preselected import content which is defined in the "smw.vocab.json" file and includes:
- "Smw import skos"
- "Smw import owl"
- "Smw import foaf"
- "Foaf:knows"
- "Foaf:name" and
- "Foaf:homepage"
It should be noted that smw.vocab.json
is not expected to be the authority source of content for a wiki and is the reason why the option replaceable
is set to false
so that pre-existing content that matches the same name and namespace is not replaced by the importer.
Custom definitions
It is possible to define one or more custom import definitions using $smwgImportFileDirs
with a custom location (directory) from where import definitions can be loaded.
$GLOBALS['smwgImportFileDirs']['custom-vocab'] = __DIR__ . '/custom';
Fields
JSON
schema and fields:
description
short description about the purpose of the import (used in the auto summary)page
the name of a page without a namespace prefixnamespace
literal constant of the namespace of the content (e.g.NS_MAIN
,SMW_NS_PROPERTY
... )contents
it contains either the raw text or a parameterimportFrom
link to a file from where the raw text (contains a relative path to the$smwgImportFileDirs
)
options
replaceable
to indicate whether content is being allowed to be replaced during an import or not and can taketrue
,false
, or{ "LAST_EDITOR": "IS_IMPORTER" }
to support a replacement when the last editor is the same as the import creator (hereby provides a method to extend content as long as the source page wasn't altered by someone or something else).
The $smwgImportReqVersion
stipulates the required version for an import where only definitions that match that version are permitted to be imported.
Examples
XML import
It is possible to use MediaWiki's XML format as import source when linked from the
importFrom
field (any non MediaWiki XML format will be ignored).
The location for the mentioned custom.xml
is relative to the selected $smwgImportFileDirs
directory.
{ "description": "Custom import", "import": [ { "description" : "Import of custom.xml that contains ...", "contents": { "importFrom": "/xml/custom.xml" } } ], "meta": { "version": "1" } }
Text import
{ "description": "Template import", "import": [ { "description" : "Template to ...", "page": "Template_1", "namespace": "NS_TEMPLATE", "contents": "{{{1}}}, {{{2}}}", "options": { "replaceable": false } }, { "description" : "Template with ...", "page": "Template_2", "namespace": "NS_TEMPLATE", "contents": { "importFrom": "/templates/template-1.tmpl" }, "options": { "replaceable": false } } ], "meta": { "version": "1" } }
Import process
During the setup process, the Installer
will automatically run and inform
about the process which will output something similar to:
Import of smw.vocab.json ... ... replacing MediaWiki:Smw import foaf contents ... ... skipping Property:Foaf:knows, already exists ... Import processing completed.
If not otherwise specified, content (a.k.a. pages) that pre-exists are going to be skipped by default.
Technical notes
SMW::SQLStore::Installer::AfterCreateTablesComplete
is the event to import content during the setup
- src
- Importer
ImporterServiceFactory
access to import servicesImporter
is responsible for importing contents provided by aContentIterator
ContentIterator
an interface to provide access to individualImportContents
instancesJsonContentIterator
implements theContentIterator
interfaceJsonImportContentsFileDirReader
provides contents of all recursively fetched files from a location (e.g$smwgImportFileDirs
setting ) that meets the requirementsContentModeller
interprets theJSON
definition and returns a set ofImportContents
instancesContentCreator
an interface to specify different creation methods (e.g. text, XML etc.)- ContentCreators
DispatchingContentCreator
dispatches to the actual content creation instance based onImportContents::getContentType
XmlContentCreator
support the creation of MediaWiki XML specific contentTextContentCreator
support for raw wikitext
See also[edit]
- Help page on importing vocabulary
- Help page on configuration parameter
$smwgImportFileDirs
Sets the contents import directory for legitimate files to be imported upon installation of Semantic MediaWiki - Help page on configuration parameter
$smwgImportReqVersion
Sets the contents import file version for legitimate files to be imported upon installation of Semantic MediaWiki - Help page on maintenance script "runImport.php"Allows to import content from import files
References
- a b Semantic MediaWiki: GitHub pull request gh:smw:2289
- a b Semantic MediaWiki: GitHub pull request gh:smw:2386
- ^ Semantic MediaWiki: GitHub pull request gh:smw:4466