Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/cadmiumkitty/rdfpandas
RdfPandas is a module providing RDF support for Pandas
https://github.com/cadmiumkitty/rdfpandas
linked-data pandas rdf rdflib semantic-web
Last synced: 2 months ago
JSON representation
RdfPandas is a module providing RDF support for Pandas
- Host: GitHub
- URL: https://github.com/cadmiumkitty/rdfpandas
- Owner: cadmiumkitty
- License: mit
- Created: 2018-05-07T11:52:30.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2024-09-21T00:10:28.000Z (4 months ago)
- Last Synced: 2024-09-21T11:50:41.588Z (4 months ago)
- Topics: linked-data, pandas, rdf, rdflib, semantic-web
- Language: Python
- Homepage:
- Size: 26.4 KB
- Stars: 51
- Watchers: 2
- Forks: 8
- Open Issues: 0
-
Metadata Files:
- Readme: README.rst
- License: LICENSE
Awesome Lists containing this project
- awesome-ccamel - cadmiumkitty/rdfpandas - RdfPandas is a module providing RDF support for Pandas (Python)
README
RdfPandas
=========RdfPandas is a module providing RDF support for Pandas. It consists of
two simple functions for Graph to DataFrame conversion and
DataFrame to Graph conversion.Getting Started
---------------For more information about Resource Description Framework (RDF) and Pandas see:
- RDF: https://www.w3.org/RDF/
- Pandas: https://pandas.pydata.org/Prerequisites
-------------You will need Python 3 to use Pandas and RdfPandas.
Installation
------------::
pip install rdfpandas
Usage
-----Creating RDF from DataFrame
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^As of version 1.1.0 NamespaceManager can be supplied to ``rdflib.to_graph`` for conversion to Graph.
::
from rdfpandas.graph import to_graph
import pandas as pd
import rdflib
df = pd.read_csv('to_graph_test.csv', index_col = '@id', keep_default_na = False)
namespace_manager = NamespaceManager(Graph())
namespace_manager.bind('skos', SKOS)
namespace_manager.bind('rdfpandas', Namespace('http://github.com/cadmiumkitty/rdfpandas/'))
g = to_graph(df, namespace_manager)
s = g.serialize(format = 'turtle')Creating DataFrame from RDF
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^::
from rdfpandas.graph import to_dataframe
import pandas as pd
import rdflib
g = rdflib.Graph()
g.parse('to_df_test.ttl', format = 'ttl')
df = to_dataframe(g)
df.to_csv('test.csv', index = True, index_label = "@id")Gotchas
-------No special effort is made for dealing with types, so please be aware of Pandas
features such as https://pandas.pydata.org/pandas-docs/stable/user_guide/gotchas.html#support-for-integer-na
that may result in surprising RDF statements like ``"10.0"^^``.