Semantic Assistants Wiki (SAW) In the Context of the ETC Project
SAW Intro Wiki – Users collaboratively develop and organize content – Information Analysis is up to the user Goal: – “Self-aware wiki that can develop and organize its content” – Support users in information analysis Requires NLP to handle majority of content Semantic Assistants Wiki: Integration of NLP with Wikis
SAW in Action WikiWiki-NLP IntegrationSemantic AssistantsGATE NLP Pipeline: Names Entity Recognition wiki.org/Mary “…Mary won…” … Mary won the first prize... … [[hasType::Person|Mary]] won… XML
Example: Biomedical Literature Curation GenWiki: Filled with full text research papers Entity Recognition: Enzymes, Organisms Additional semantic information – Systematic name – Link to enzyme database entry Wiki is enriched with additional data e.g. using Semantic MediaWiki markup [[hasType::Enzyme] Time required to curate research papers reduced
Example: Wiktionary Automatically populate the wiki using computational linguistics Cross-link between different language entries Manual work can be reduced
SAW for ETC Charaparser WikiWiki-NLP IntegrationSemantic Assistants etc-project.org/wiki/fna19 “…abaxial faces, without…” GATE NLP Pipeline: Charaparser “…abaxial faces, without…” … abaxial faces, without septate trichomes...
SAW for ETC Charaparser Faces * [[hasConstraint::Abaxial]] * [[without::Trichome]] Trichome * [[hasArchitecture::septate]] WikiWiki-NLP IntegrationSemantic Assistants etc-project.org/wiki/fna19/superstructure/faces Charaparser NLP Pipeline etc-project.org/wiki/fna19/superstructure/trichome XML
SAW for ETC Charaparser – GATE compliant NLP pipeline – Charaparser output to wiki markup translation module – Wiki as ‘User Interface’ of Charaparser Logic Reasoning, Information Theory, Ontology building can – Read and query charaparser results from wiki (RDF triplets) – Be possibly integrated in wiki
Open Questions Charaparser as GATE compliant pipeline; Effort? Alternatives to Semantic Assistant Wiki? Apache Stanbol? Representation of Structure, Character, Relation in Wiki (e.g. duplicate structure names)
References Bahar Sateli and René Witte. Natural Language Processing for MediaWiki: The Semantic Assistants Approach. WikiSym Bahar Sateli, Marie-Jean Meurs, Greg Butler, Justin Powlowski, Adrian Tsang, René Witte. IntelliGenWiki: An Intelligent Semantic Wiki for Life Sciences. NETTAB René Witte and Thomas Gitzinger. Connecting Wikis and Natural Language Processing Systems. WikiSym
Architecture