https://github.com/johannesbuchner/flight-reservation-emails
Searches your emails for flight tickets & displays a summary with all flight details
https://github.com/johannesbuchner/flight-reservation-emails
email-parsing flights mining
Last synced: 3 months ago
JSON representation
Searches your emails for flight tickets & displays a summary with all flight details
- Host: GitHub
- URL: https://github.com/johannesbuchner/flight-reservation-emails
- Owner: JohannesBuchner
- License: bsd-2-clause
- Created: 2017-04-28T12:34:30.000Z (over 8 years ago)
- Default Branch: master
- Last Pushed: 2018-12-02T18:54:00.000Z (almost 7 years ago)
- Last Synced: 2025-07-16T11:20:49.594Z (3 months ago)
- Topics: email-parsing, flights, mining
- Language: Perl
- Homepage:
- Size: 37.1 KB
- Stars: 39
- Watchers: 2
- Forks: 6
- Open Issues: 0
-
Metadata Files:
- Readme: README.rst
- License: LICENSE
Awesome Lists containing this project
README
Flight & Hotel reservation email parser
========================================Searches emails for flight tickets and hotel reservations.
Builds a brief summary view of all your reservations over time.--------
Usage
--------1. Adding emails to the email database
People store emails in various ways.
Here we support the notmuch database (https://notmuchmail.org/)
It is trivial to include maildir emails into notmuch with "notmuch new".For email programs with mailbox, mb2md (http://batleth.sapienti-sat.org/projects/mb2md/) can be run to convert to maildir, followed by "notmuch new"
For Thunderbird, the thunderbird-notmuch-import.py script is provided,
which finds the relevant folders automatically.
2. Building the reportrun with some email search keywords::
$ python summary.py 'schema.org/FlightReservation OR ticket OR flight OR flug OR viaje OR booking OR confirmation OR confirmacion'It will give you some idea of what it finds, for example::
2015-11-28 Flight HOUSTON, TX --> WASHINGTON, DCDeparting 2015-11-28 19:10
from HOUSTON, TX
arriving 2015-11-28 23:05
To WASHINGTON, DC
Flight number UA1955
3. View report
For an example report see "summary.html"!
Features implemented
----------------------* Summary of all flights, with crucial information (when, from-to, ...)
* Including PDF eticket files, extracted from emails.
* Parallel parsing for speed-up.
* Parsing of the flight reservations schema following https://developers.google.com/gmail/markup/reference/flight-reservation
* Some heuristic parsing of html emails in English, Spanish and German.
If you have emails that can serve as additional test cases, please submit
them. Contributions to the parsing are welcome!To Do
------------* More heuristic parsing rules.
* Implement hotel bookings (https://developers.google.com/gmail/markup/reference/hotel-reservation). Booking.com and some others produce the json version.