{"id":18580415,"url":"https://github.com/gumblex/htmllisting-parser","last_synced_at":"2025-08-21T06:30:56.145Z","repository":{"id":52609775,"uuid":"75297325","full_name":"gumblex/htmllisting-parser","owner":"gumblex","description":"Python parser for Apache/nginx-style HTML directory listing","archived":false,"fork":false,"pushed_at":"2023-11-25T07:02:29.000Z","size":41,"stargazers_count":35,"open_issues_count":9,"forks_count":12,"subscribers_count":4,"default_branch":"master","last_synced_at":"2024-12-15T18:09:53.644Z","etag":null,"topics":["apache","fuse","nginx"],"latest_commit_sha":null,"homepage":null,"language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/gumblex.png","metadata":{"files":{"readme":"README.rst","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null,"governance":null,"roadmap":null,"authors":null,"dei":null,"publiccode":null,"codemeta":null}},"created_at":"2016-12-01T13:55:33.000Z","updated_at":"2024-09-18T09:40:02.000Z","dependencies_parsed_at":"2024-06-18T21:24:04.214Z","dependency_job_id":null,"html_url":"https://github.com/gumblex/htmllisting-parser","commit_stats":{"total_commits":29,"total_committers":5,"mean_commits":5.8,"dds":0.3448275862068966,"last_synced_commit":"acfdc723b9cd5b94de2df73eea9d1cf9a2cb0934"},"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gumblex%2Fhtmllisting-parser","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gumblex%2Fhtmllisting-parser/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gumblex%2Fhtmllisting-parser/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/gumblex%2Fhtmllisting-parser/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/gumblex","download_url":"https://codeload.github.com/gumblex/htmllisting-parser/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":230494921,"owners_count":18235046,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["apache","fuse","nginx"],"created_at":"2024-11-06T23:46:00.862Z","updated_at":"2024-12-19T20:08:23.708Z","avatar_url":"https://github.com/gumblex.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"htmllisting-parser\n==================\nPython parser for Apache/nginx-style HTML directory listing\n\n.. code-block:: python\n\n   import htmllistparse\n   cwd, listing = htmllistparse.fetch_listing(some_url, timeout=30)\n\n   # or you can get the url and make a BeautifulSoup yourself, then use\n   # cwd, listing = htmllistparse.parse(soup)\n\nwhere ``cwd`` is the current directory, ``listing`` is a list of ``FileEntry`` named tuples:\n\n* ``name``: File name, ``str``. Have a trailing / if it's a directory.\n* ``modified``: Last modification time, ``time.struct_time`` or ``None``. Timezone is not known.\n* ``size``: File size, ``int`` or ``None``. May be estimated from the prefix, such as \"K\", \"M\".\n* ``description``: File description, file type, or any other things found. ``str`` as HTML, or ``None``.\n\nSupports:\n\n* Vanilla Apache/nginx/lighttpd/darkhttpd autoindex\n* Most ``\u003cpre\u003e``-style index\n* Many other ``\u003ctable\u003e``-style index\n* ``\u003cul\u003e``-style\n\n.. note::\n   Please wrap the functions in a general ``try... except`` block. It may throw exceptions unexpectedly.\n\nReHTTPFS\n--------\n\nReinvented HTTP Filesystem.\n\n* Mounts most HTTP file listings with FUSE.\n* Gets directory tree and file stats with less overhead.\n* Supports Range requests.\n* Supports Keep-Alive.\n\n::\n\n   usage: rehttpfs.py [-h] [-o OPTIONS] [-t TIMEOUT] [-u USER_AGENT] [-v] [-d]\n                      url mountpoint\n\n   Mount HTML directory listings.\n\n   positional arguments:\n     url                   URL to mount\n     mountpoint            filesystem mount point\n\n   optional arguments:\n     -h, --help            show this help message and exit\n     -o OPTIONS            comma separated FUSE options\n     -t TIMEOUT, --timeout TIMEOUT\n                           HTTP request timeout\n     -u USER_AGENT, --user-agent USER_AGENT\n                           HTTP User-Agent\n     -v, --verbose         enable debug logging\n     -d, --daemon          run in background\n\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgumblex%2Fhtmllisting-parser","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Fgumblex%2Fhtmllisting-parser","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Fgumblex%2Fhtmllisting-parser/lists"}