Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/vkuznet/cmswebanalytics
A project to perform CMS web analytics.
https://github.com/vkuznet/cmswebanalytics
Last synced: 30 days ago
JSON representation
A project to perform CMS web analytics.
- Host: GitHub
- URL: https://github.com/vkuznet/cmswebanalytics
- Owner: vkuznet
- Created: 2017-12-17T18:03:08.000Z (about 7 years ago)
- Default Branch: master
- Last Pushed: 2018-01-02T18:39:37.000Z (about 7 years ago)
- Last Synced: 2024-10-30T06:27:34.944Z (3 months ago)
- Size: 4.63 MB
- Stars: 0
- Watchers: 3
- Forks: 0
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
README
# CMSWebAnalytics
A project to perform CMS web analytics.### Introduction
CMS collects various logs from data-services deployed on cmsweb cluster.
They are DBS, DAS, SiteDB, Phedex, ReqMgr2, etc.The original [code](https://github.com/dmwm/analytics) was written in Python
and deployed on slc5_amd64_gcc461 architecture. It parsed log files available
on vocms022 and produce series of dataframes stored either in json, txt or
sqlitedb. These data are availabel in /afs/cern.ch/cms/cmsweb/analytics
area. The final set of data was feed into web interface where access to
data-services was displayed as following![access1](https://github.com/vkuznet/CMSWebAnalytics/blob/master/images/plot1.png)
![access2](https://github.com/vkuznet/CMSWebAnalytics/blob/master/images/plot2.png)### Tasks
We may either try to use existing [code](https://github.com/dmwm/analytics) and
setup it elsewhere. Or, we can write very simple python go Go-based parser
and produce JSON files which later can be fed into CERN MONIT system.- write Go-based parser to parse apache log files. Here is an example of apache
entry:
```
[12/Dec/2017:00:59:59 +0100] cmsweb.cern.ch xxxx:yyyy:zzz:a8::100:21a "GET /couchdb/reqmgr_workload_cache/pdmvserv_task_B2G -RunIISummer16DR80Premix-02387__v1_T_171015_165132_2461 HTTP/1.1" 200 [data: 3358 in 26751 out 7823 body 197194 us ] [auth: TLSv1.2 DHE-RSA-AES256-GCM-SHA384 "/DC=YY/DC=BLA/OU=FOO/CN=bla" "-" ] [ref: "-" "WMCore.Services.Requests/v001" ]
```
- create output DataFrames (in CSV and JSON) formats with the following content:
- date,service,request_count
- date,service,DN,request_count
the later DataFrame will be used to identify most active users
- use go-stomp package to send data to CERN MONIT system, see examples
github.com/thomasmodeneis/go-amq-poc and github.com/go-stomp/stomp