Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/andikleen/weblog
simple shell oriented web log analysis tools
https://github.com/andikleen/weblog
Last synced: 11 days ago
JSON representation
simple shell oriented web log analysis tools
- Host: GitHub
- URL: https://github.com/andikleen/weblog
- Owner: andikleen
- Created: 2012-04-19T04:09:08.000Z (over 12 years ago)
- Default Branch: master
- Last Pushed: 2013-05-29T23:59:49.000Z (over 11 years ago)
- Last Synced: 2023-03-12T07:06:54.409Z (over 1 year ago)
- Language: Perl
- Size: 125 KB
- Stars: 3
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
![weblog] (http://halobates.de/weblog)
Simple web log analysis tools
Originally written long ago.
Written for apache, but should work with all web servers generating
standard logs. This is a light weight alternative to more complex
database based setups. The output is simple ASCII that
can be easily processed further in shell pipelines. They can be also
combined for more complex tasks.One big advantage is simplicity, so can be easily adapted for specific
purposes.It uses multiple passes through the logs and will likely not scale
to large logs. However it works quite well for moderate sized logs.searchterms may need occasional updates for the latest URLs generated
by search enginesTo use add the directory to your $PATH
export PATH=dir/webtools
or call weball with an absolute name to let it set the path.If you don't know how to use it just run weball
Dependencies: Perl-TimeDate
(yum install Perl-TimeDate or similar)Tools:
## weball log
run all analyses on a logsomewhat slow for larger logs, because it does many passes
N=NUM weball log
print NUM top entries## searchterms log
extract search engine search terms from a http log
referer logging needs to be enabled
options:
-n list numeric ips in front
-c add search engine domain name
-u print target
-U print URLs typed into search engine
-p print position in search engine results (or 0)## referer log
print referer from a httpd access log## removecrawl log
remove search machine crawls from a http logfile## robots log
display search engines## tophits log
print hits for pages## trails log
Identify individual visitors## useragents log
print user agents from a log## webbytes log
print total bytes transferred## accumulate [fieldnum]
accumulate on fieldnum field
default field 1For example to track countries of search engine
searchterms -c log | accumulate## weberrors log
print errors from a weblogTo get top errors
weberrors log | accumulate 2 | head## webresolveip log
resolve hosts in a weblog## webtimerange
print time range in a logAndi Kleen