Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vladimiralexiev/ontotext-graphdb-applications
Applications of Ontotext GraphDB: ENDORSE 2021 presentation, Zotero bibliography, Wordcloud
https://github.com/vladimiralexiev/ontotext-graphdb-applications
graphdb semantic-web
Last synced: 2 days ago
JSON representation
Applications of Ontotext GraphDB: ENDORSE 2021 presentation, Zotero bibliography, Wordcloud
- Host: GitHub
- URL: https://github.com/vladimiralexiev/ontotext-graphdb-applications
- Owner: VladimirAlexiev
- Created: 2018-12-27T13:12:48.000Z (almost 6 years ago)
- Default Branch: master
- Last Pushed: 2021-03-18T08:50:41.000Z (almost 4 years ago)
- Last Synced: 2024-11-10T09:29:28.514Z (about 1 month ago)
- Topics: graphdb, semantic-web
- Homepage:
- Size: 17.5 MB
- Stars: 2
- Watchers: 4
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.org
Awesome Lists containing this project
README
* Table of Contents :TOC:noexport:
:PROPERTIES:
:TOC: :include all
:END::CONTENTS:
- [[#diverse-uses-of-ontotext-graphdb][Diverse Uses of Ontotext GraphDB]]
- [[#graphdb-on-wikipedia][GraphDB on Wikipedia]]
- [[#endorse-conference][ENDORSE Conference]]
- [[#zotero-bibliography][Zotero Bibliography]]
- [[#sources][Sources]]
- [[#wordcloud][WordCloud]]
- [[#topics][Topics]]
:END:* Diverse Uses of Ontotext GraphDB
[[http://graphdb.ontotext.com][Ontotext GraphDB]] has a Free edition that only has a limit on the number of concurrent queries.
People tell us that they like the developer and user-friendliness of the database, especially the GraphDB Workbench.
As a result, it's used widely in academic research.GraphDB Free collects usage statistics (you can opt out of that), but I don't believe we have analyzed it systematically.
Another way to analyze where it is used is to research the literature.** GraphDB on Wikipedia
I started a Wikipedia table entry about GraphDB:
https://en.wikipedia.org/wiki/Talk:Graph_database#Ontotext_GraphDB.
As part of that, in Dec 2018 I did some analysis and saw that is used in a wide variety of domains, eg:
social media analytics, data marketplaces, engineering, life sciences, medicine, chemistry, art history vocabularies, Holocaust research, etc.** ENDORSE Conference
A version of this work was presented as:
- Vladimir Alexiev, [[https://github.com/VladimirAlexiev/ontotext-graphdb-applications/raw/master/Diverse%2520Uses%2520of%2520a%2520Semantic%2520Graph%2520Database%2520for%2520Knowledge%2520Organization%2520and%2520Research%2520(ENDORSE%25202021).pptx][Diverse Uses of a Semantic Graph Database for Knowledge Organization and Research]],
European Data Conference on Reference Data and Semantics (ENDORSE 2021), 16-19 March 2021.
Organized by the EU Publications Office and the ISA2 Programme
- Abstract: Semantic Graph Databases are the foundation of Enterprise Knowledge Graphs. They are used in numerous industrial applications, but also Knowledge Organization Management systems (thesaurus and ontology management systems), such as VocBench, SWC PoolParty, Synaptica Semaphore. Through VocBench, semantic databases manage or publish some of the most important thesauri: EuroVoc, AgroVoc, the Getty Vocabularies, etc. Semantic databases are also used in a wide variety of research domains and projects. Some have open source or free editions that make them an easy choice for academic research. We searched on Google Scholar and found 1000-1200 academic papers and theses mentioning one of the popular databases. We also found at least 50 books on Google Books that mention it. We started a Zotero bibliography on the topic (currently about 150 papers), and captured about 220 research topics, based on the titles of about 250 papers. We will present an analysis of reference data and research domains using a semantic database. Some of the traditional topics include: social media analytics, data marketplaces, business process management, enterprise data integration, statistical data, engineering, smart cities, sensor networks, life sciences, biomedical ontologies, medicines, chemistry, linguistic data, semantic publishing, semantic text analysis, geographic information, master data management. Newer or more exotic topics include academic/research data, COVID and Zika viruses, Quran and bilingual Arabic-English data, art history, Holocaust research, musical events and adaptations, iconography, food and drink, tourism, investment decision support, economic research, offshore leaks, maritime data, construction projects, building information management, crisis management, critical incidents and infrastructures, data journalism, clinical trials and specific medical topics (e.g. intestinal cells, intracoronal tooth restorations, vaccines, toxicology), investment recommendations, data journalism, etc.[[./img/Diverse Uses of a Semantic Graph Database for Knowledge Organization and Research (ENDORSE 2021).png]]
* Zotero Bibliography
I started a Zotero bibliography [[https://www.zotero.org/groups/2744757/ontotext-graphdb][Ontotext GraphDB]] that as of 1-Mar-2021 includes [[https://www.zotero.org/groups/2744757/ontotext-graphdb/library][175 papers]] mentioning GraphDB:
[[./img/zotero-graphdb.png]]
It includes the following categories:
- About GraphDB (16): papers/blogs specifically about GraphDB, by Ontotext staff or others
- Benchmarks (13): various benchmarking efforts where GraphDB has been evaluated amongst other semantic databases
- Business Process Management (2): applications of semantic technology to BPM
- Cultural Heritage (11): cultural heritage and digital humanities
- Economics, Business Data, Tourism (11): tourism, hotels, investment decision support, company data, industry classification
- Education, Science KGs (5): science data, academic management, etc
- Geospatial (5): geospatial data and benchmarks
- Industrial Data, Manufacturing (8): industrial standards, sensors, building automation
- Knowledge Graphs (6): KG techniques
- Law, LegalTech (2): legal texts and knowledge bases
- Life Sciences, Bio Informatics (19): genes, organisms, biomedical ontologies, semantic search
- Market Assessments (3): the semantic database market
- Querying, Visualization (6): SPARQL optimization, query writing, NLP querying, visualization
- Semantic Text Analysis, Lexicography, Semantic Publishing, Social Networks (18)
- Smart Cities (3): city infrastructure, cyber-physical systems
- Statistical, Regional Data (3)
- Storage, Reasoning, Graph Databases (22): storage techniques, reasoning, rules
- Thesaurus, Ontology Management, Conceptual Modeling, Classification (11)
- Validation, RDF Shapes, Schemas (2): SHACL implementation benchmarks, APIs from shapesIf you have some other information needs or ideas, post an issue or a pull request!
** Sources
Here are some sources of references, refreshed as of Dec 2019:
- [[https://scholar.google.bg/scholar?q=ontotext+graphdb+OR+owlim][Google Scholar]] (search for "ontotext graphdb OR owlim"): about 1000-1200 academic papers that mention it (research articles, theses, etc).
I checked the [[https://scholar.google.bg/scholar?start=990&q=ontotext+graphdb+OR+owlim][page starting at result 990]] and although most of those 10 papers are not in English (I hit Spanish, German, French and Catalonian),
I was able to verify that all of them mention it.
There is some duplication, but even assuming this represents 700 unique academic papers, that's still very impressive.
- [[https://www.google.com/search?q=ontotext+graphdb+OR+owlim&tbm=bks][Google Books]] (search for "ontotext graphdb OR owlim"): it's hard to say how many of these hits actually mention it since Google Books search uses some associative algorithm.
But I checked the first 5 pages and it seems most of those 50 books mention it.
- [[https://news.google.com/search?q=ontotext][Google News]] (search for "Ontotext", not Ontotext GraphDB): this is harder to judge, because:
- It includes many company press releases
- It seems to index not only the text of news items, but also header/footer text.
Eg the news item [[https://commercegazette.com/2019/12/09/automatic-identification-and-data-capture-aidc-market-worth-100-5-billion-by-2025-datalogic-honeywell-zebra-technologies-sick-cognex-sato-denso-wave-cipherlab-2/][Automatic Identification and Data Capture (AIDC) Market worth $100.5 billion by 2025-Datalogic, Honeywell, Zebra Technologies, Sick, Cognex, Sato, Denso Wave, Cipherlab]]
is not related to Ontotext, except that it mentions the item
[[https://commercegazette.com/2019/12/09/semantic-knowledge-discovery-software-market-research-2019-key-players-maanachemaxonexpert-system-spaontotextcollexis-elseviercambridge-semanticsnervana-intel/][Semantic Knowledge Discovery Software Market Research 2019: Key Players- MAANA,ChemAxon,Expert System SpA,OntoText,Collexis (Elsevier),Cambridge Semantics,Nervana (intel)]] at the bottom.* WordCloud
Here is a word-cloud of applications of Ontotext GraphDB in academic papers, based on the Topics shown in the next section
- Save the extracted topics (see next section) to [[./graphdb-wordCloud.txt]]
- Replace spaces with "~" to make the tool treat phrases not words
- I used [[http://WordClouds.com][WordClouds.com]] to generate this cloud
- Configuration saved as: [[./graphdb-wordCloud.wcld]] (CloudConfig v3.5.3)
- Use ~File> Open Text File~ and load the above file. It represents the following settings:
- ~File> Paste/Type text~: paste [[./graphdb-wordCloud.txt]]
- Size: 800x800
- Gap size: 2
- Direction of words: uphill/horizontal/downhill
- Shape> shapes: christmas tree
- Mask: checked
- Theme: 4-th one with 6 colors
- Colors> background> Color: select a dark green
- Font: Verdana
- Font size (slider above the wordlet): 51 (to fill out the tree shape in a good way)
- Saved as ontotext-graphdb-applications.png (800x800)[[./img/ontotext-graphdb-applications.png]]
- Saved as ontotext-graphdb-applications-HD.png (3200x3200, 2Mb)
- Saved as ontotext-graphdb-applications.svg: Please note that this is a bit inferior "When saving as SVG, masks and backgrounds are not visible".
[[./img/ontotext-graphdb-applications.svg]]
** Topics
Here are 230 research topics (phrases) that I've extracted from the first 220 or so papers on Google Scholar (only looking at article titles, not abstracts):- Bulgarian icons
- Al-Quran
- COVID-19
- EC Publications Office
- EUCases
- Europeana
- FrameNet
- GDPR compliance
- GIS
- GeoSPARQL
- GeoSPARQL+
- Geographica
- Getty vocabularies
- Holocaust research
- ISO 25964
- ISOcat
- IoT systems
- NLP Interchange Format
- OceanGraph
- Open Street Map
- OpenBiodiv
- Orthodox icons
- Panama papers
- SHACL implementation
- Semalytics
- TheyBuyForYou
- XO laptop
- Zika virus
- academic management
- adaptive semantic publishing
- air traffic management
- ambient assisted living
- amplicon analysis
- art research vocabularies
- audiovisual
- bibliographic references
- bibliographic repositories
- big data
- bilingual Arabic-English
- biodiversity
- biogeographic data
- biomedical metadata
- biomedical research
- biorefineries
- building construction
- building information management
- building maintenance
- business process management
- cancer data
- chemical contaminants
- city-wide infrastructure
- clinical assessment
- clinical autonomy assistance
- clinical trials
- cloud infrastructure code
- company information
- conceptual modeling
- construction projects
- consumer data
- context configuration
- contextualized knowledge
- conversational assistants
- crisis management
- critical incidents
- critical infrastructures
- crowdsourcing
- cultural heritage
- curation systems
- custom rules
- cyber-physical social system
- cyber-physical system
- cyber-physical topology
- data as-a-service
- data journalism
- data marketplaces
- data science pipelines
- data-as-a-service
- decision support system
- defense analysis
- depression
- dialog generation
- dietary supplements
- digital mapping
- digital repositories
- disaster response
- disputed information
- district energy
- diversity-aware ranking
- earth observation
- educational metadata
- electronic health records
- electrophysiological experiments
- engineering services
- enterprise data fabric
- environmental health
- environmental legislation
- euBusinessGraph
- event recognition
- fact mining
- factory planning
- food and agriculture
- food and drink gazetteer
- football information
- geneology
- geolocation
- geospatial
- glycobiology
- graph data
- health care evidence
- health data analytics
- historical texts
- hotel data
- human nervous system
- hybrid knowledge bases
- industrial IoT
- industrial machine sensors
- industrial megaprojects
- infrastructure projects
- integrated querying
- interdisciplinarity
- internet of things
- intestinal cells
- intracoronal tooth restorations
- investment decision support
- investment recommendation
- journalism
- journalist workbench
- knowledge graph
- knowledge-based coordination
- land cover
- language learning
- legal cases
- legal linked open data
- library linked data
- licensing understanding
- life cycle construction information
- life sciences
- linear city infrastructure
- linguistic data
- linked data management
- literature-extracted data
- machine maintenance
- maritime domain
- mathematical expressions
- media monitoring
- medical decision support
- medical knowledge discovery
- medical texts
- metadata management
- metamodeling
- mind mapping
- model-driven
- molecule databases
- multi-disciplinary domains
- multimedia
- multimodal transport
- museums
- music events
- musical adaptations
- natural hazards
- neuroinformatics
- obesity
- offshore leaks
- offshore tax-sheltered firms
- online newspapers
- ontology inference
- ontology storage
- ontology-based classification
- open innovation
- patient data
- photographic archives
- plant phenotyping
- protein function prediction
- public events
- public procurement
- public service descriptions
- ranking computations
- raster data
- region-based data
- rights enforcement
- risk factors identification
- safe food consumption
- schema.org
- scientific publications
- security knowledge
- semantic publishing
- semantic subscription
- sensor measurements
- sensor networks
- sensor stream processing
- single-subject design studies
- smart cities
- software architecture representation
- software engineering
- spatial data infrastructures
- spatio-temporal
- sports statistics
- statistical data
- tensor-based triple store
- territorial partitions
- territorial units
- text analytics
- thesauri
- topic modeling
- toxicology
- traffic data analysis
- translational research
- transport interoperability
- trigonometry
- uncertainty handling
- vaccines
- vector data
- vector-space model