https://github.com/ericflo/pynzb

pynzb is a unified API for parsing NZB files, with several concrete implementations included
https://github.com/ericflo/pynzb

Last synced: 11 months ago
JSON representation

pynzb is a unified API for parsing NZB files, with several concrete implementations included

Host: GitHub
URL: https://github.com/ericflo/pynzb
Owner: ericflo
License: bsd-3-clause
Created: 2009-05-11T06:12:31.000Z (about 17 years ago)
Default Branch: master
Last Pushed: 2017-01-12T01:19:15.000Z (over 9 years ago)
Last Synced: 2025-06-20T14:06:38.051Z (about 1 year ago)
Language: Python
Homepage:
Size: 103 KB
Stars: 18
Watchers: 5
Forks: 6
Open Issues: 3
Metadata Files:
- Readme: README.rst
- License: LICENSE.txt

Awesome Lists containing this project

README

          Introduction

------------

NZB is an XML-based file format for retrieving posts from NNTP (Usenet) servers.

Since NZB is XML-based, it's relatively easy to build one-off parsers to parse

NZB files.  This project is an attempt to consolidate those many one-off NZB

parsers into one simple interface.

This package includes three implementations: one based on expat, another based

on ElementTree, and a final implementation based on lxml.  The order in which

they were listed is in order of compatibility.  The expat version should work on

all versions of Python > 2.0, the lxml one will work on all versions > 2.5, and

lxml will only work if you have lxml installed.

A Note on Installing lxml

-------------------------

While lxml is not a requirement, I have had a hard time installing lxml in the

past.  I have found this set of commands to work perfectly::

    STATIC_DEPS=true easy_install 'lxml>=2.2beta4'

    STATIC_DEPS=true sudo easy_install 'lxml>=2.2beta4'

API Documentation

-----------------

Accessing the Default Parser

============================

Simply import nzb_parser from the pynzb package.  It's an instantiated version

of the fastest available parser that your system can support.

Other Parser Locations

======================

``ExpatNZBParser``:

    Available in the ``pynzb.expat_nzb`` namespace.

``ETreeNZBParser``:

    Available in the ``pynzb.etree_nzb`` namespace.

``LXMLNZBParser``:

    Available in the ``pynzb.lxml_nzb`` namespace.

Using the NZB Parser

====================

If you're using a specific parser, like the ``ETreeNZBParser``, you will first

have to instantiate it::

    nzb_parser = ETreeNZBParser()

Otherwise, you can just import the default parser for your system::

    from pynzb import nzb_parser

Then, simply call the ``parse`` method, giving it the xml string as the only

argument::

    files = nzb_parser.parse('')

This will return a list of ``NZBFiles`` for you to use.

NZBFile Objects

===============

All of the parsers return ``NZBFile`` objects, which are objects with the

following properties:

``poster``:

    The name of the user who posted the file to the newsgroup.

``date``:

    A ``datetime.date`` representation of when the server first saw the file.

``subject``:

    The subject used when the user posted the file to the newsgroup.

``groups``:

    A list of strings representing the newsgroups in which this file may be

    found.

``segments``:

    A list of ``NZBSegment`` objects talking about where to get the contents

    of this file.

NZBSegment Objects

==================

Each ``NZBFile`` has a list of ``NZBSegment`` objects, which include information

on how to retrieve a part of a file.  Here's what you can find on an

``NZBSegment`` object:

``number``:

    The number of the segment in the list of files.

``bytes``:

    The size of the segment, in bytes.

``message_id``:

    The Message-ID of the segment (useful for retrieving the full contents)

Example

--------

In this example, we will grab an Ubuntu NZB and parse the file, printing out

some information about each file and its segments::

    from pynzb import nzb_parser

    from urllib2 import urlopen

    # Grab a sample Ubuntu NZB

    ubuntu_nzb = urlopen('http://media.eflorenzano.com/misc/sample-ubuntu-nzb.nzb').read()

    # Parse the NZB into files

    files = nzb_parser.parse(ubuntu_nzb)

    # Print out each file's subject and the first two segment message ids

    for nzb_file in files:

        print nzb_file.subject

        for segment in nzb_file.segments[:2]:

            print '    ' + segment.message_id

        if len(nzb_file.segments) > 2:

            print '    ...'

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ericflo/pynzb

Awesome Lists containing this project

README