{"id":18906723,"url":"https://github.com/oduwsdl/accesslog-parser","last_synced_at":"2025-10-16T01:07:28.303Z","repository":{"id":57407883,"uuid":"211740760","full_name":"oduwsdl/accesslog-parser","owner":"oduwsdl","description":"Web server access log parser and CLI tool with added features for web archive replay logs","archived":false,"fork":false,"pushed_at":"2019-10-12T22:38:44.000Z","size":49,"stargazers_count":1,"open_issues_count":0,"forks_count":0,"subscribers_count":6,"default_branch":"master","last_synced_at":"2025-02-10T05:06:07.935Z","etag":null,"topics":["access-logs","accesslog-parser","cli","memento","package","parser","python3"],"latest_commit_sha":null,"homepage":"","language":"Python","has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":"mit","status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/oduwsdl.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":"LICENSE","code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2019-09-29T23:44:58.000Z","updated_at":"2020-11-01T15:28:25.000Z","dependencies_parsed_at":"2022-09-26T16:30:57.816Z","dependency_job_id":null,"html_url":"https://github.com/oduwsdl/accesslog-parser","commit_stats":null,"previous_names":["oduwsdl/archive-accesslog"],"tags_count":5,"template":false,"template_full_name":null,"repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oduwsdl%2Faccesslog-parser","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oduwsdl%2Faccesslog-parser/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oduwsdl%2Faccesslog-parser/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/oduwsdl%2Faccesslog-parser/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/oduwsdl","download_url":"https://codeload.github.com/oduwsdl/accesslog-parser/tar.gz/refs/heads/master","host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":239890484,"owners_count":19713964,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2022-07-04T15:15:14.044Z","host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["access-logs","accesslog-parser","cli","memento","package","parser","python3"],"created_at":"2024-11-08T09:18:30.767Z","updated_at":"2025-10-16T01:07:23.266Z","avatar_url":"https://github.com/oduwsdl.png","language":"Python","funding_links":[],"categories":[],"sub_categories":[],"readme":"# AccessLog Parser and CLI\n\nWeb server access log parser and CLI tool with added features for web archive replay logs.\n\n## Installation\n\nThis package requires Python 3.6 or above.\nInstall the latest version of the released package and `accesslog` CLI tool from PyPi:\n\n```\n$ pip install accesslog\n```\n\nAlternatively, build and install the development version of the package:\n\n```\n$ git clone https://github.com/oduwsdl/accesslog-parser.git\n$ cd accesslog-parser\n$ python3 setup.py install\n$ accesslog --version\n```\n\n## Input Parsing\n\nTODO\n\n## Record Filtering\n\nTODO\n\n## Output Formatting\n\nTODO\n\n## CLI Reference\n\n```\n$ accesslog -h\nusage: accesslog [options] [FILES ...]\n\nA tool to parse Common Log formatted access logs with various derived fields.\n\npositional arguments:\n  files                 Log files (plain/gz/bz2) to parse (reads from the STDIN, if empty or '-')\n\noptional arguments:\n  -h, --help            Show this help message and exit\n  -v, --version         Show version number and exit\n  -d, --debug           Show debug messages on STDERR\n  -e FIELDS, --nonempty FIELDS\n                        Skip record if any of the provided fields is empty (comma separated list)\n  -i FIELDS, --valid FIELDS\n                        Skip record if any of the provided field values are invalid\n                        ('all' or comma separated list from 'host,request,status,size,referrer')\n  -m FIELD~RegExp, --match FIELD~RegExp\n                        Skip record if field does not match the RegExp (can be used multiple times)\n  -t TFORMAT, --origtime TFORMAT\n                        Original datetime format of logs (default: '%d/%b/%Y:%H:%M:%S %z')\n  -f FORMAT, --format FORMAT\n                        Output format string (see available formatting fields below)\n  -j FIELDS, --json FIELDS\n                        Output NDJSON with the provided fields (use 'all' for all fields except 'origline')\n\nformatting fields:\n  {origline}            Original log line\n  {host}                IP address of the client\n  {identity}            Identity of the client, usually '-'\n  {user}                User ID for authentication, usually '-'\n  {origtime}            Original date and time (typically in '%d/%b/%Y:%H:%M:%S %z' format)\n  {epoch}               Seconds from the Unix epoch (derived from origtime)\n  {date}                UTC date in '%Y-%m-%d' format (derived from origtime)\n  {time}                UTC time in '%H:%M:%S' format (derived from origtime)\n  {datetime}            14 digit datetime in '%Y%m%d%H%M%S' format (derived from origtime)\n  {request}             Original HTTP request line\n  {method}              HTTP method (empty for invalid request)\n  {path}                Path and query (scheme and host removed, empty for invalid request)\n  {prefix}              Memento endpoint path prefix (derived from path)\n  {mtime}               14 digit Memento datetime (derived from path)\n  {rflag}               Memento rewrite flag (derived from path)\n  {urir}                Memento URI-R (derived from path)\n  {httpv}               HTTP version (empty for invalid request)\n  {status}              Returned status code\n  {size}                Number of bytes returned\n  {referrer}            Referer header (empty, if not logged)\n  {agent}               User-agent header (empty, if not logged)\n  {extras}              Any additional logged fields\nDefault FORMAT: '{host} {date} {time} {method} {path} {status} {size} \"{referrer}\" \"{agent}\"'\n```\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Foduwsdl%2Faccesslog-parser","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Foduwsdl%2Faccesslog-parser","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Foduwsdl%2Faccesslog-parser/lists"}