https://github.com/practical-data-science/pandasxml
PandasXML is a Python package that imports data from remote XML files into Pandas dataframes, so they can be manipulated or exported to other file formats, such as CSV.
https://github.com/practical-data-science/pandasxml
pandas xml
Last synced: 24 days ago
JSON representation
PandasXML is a Python package that imports data from remote XML files into Pandas dataframes, so they can be manipulated or exported to other file formats, such as CSV.
- Host: GitHub
- URL: https://github.com/practical-data-science/pandasxml
- Owner: practical-data-science
- License: mit
- Created: 2022-03-17T09:13:33.000Z (about 4 years ago)
- Default Branch: master
- Last Pushed: 2022-03-17T17:09:46.000Z (about 4 years ago)
- Last Synced: 2023-03-05T19:39:28.262Z (over 3 years ago)
- Topics: pandas, xml
- Language: Python
- Homepage:
- Size: 9.77 KB
- Stars: 3
- Watchers: 1
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
README
# PandasXML
PandasXML is a Python package that imports data from remote XML files into Pandas dataframes, so they can be manipulated or exported to other file formats, such as CSV.
### Installation
You can install PandasXML by running the following command in your terminal:
```commandline
pip3 install git+https://github.com/practical-data-science/PandasXML.git
```
### Usage
Install the package and then import `pandasxml` as `px`. Pass the URL of the remote XML file to `px.read_xml()` to read the data into a Pandas dataframe.
By default, the `read_xml()` function expects elements to be named `item`, but you can define the element name using the `element_name` parameter.
```python
import pandasxml as px
df = px.read_xml('https://practicaldatascience.co.uk/sitemap.xml', element_name='url')
df.head()
```