apache-tuscany/sandbox/lresende/etc/download-script
lresende d9fd7c6afc Updating download scripts...
git-svn-id: http://svn.us.apache.org/repos/asf/tuscany@918120 13f79535-47bb-0310-9956-ffa450edef68
2010-03-02 17:30:52 +00:00
..
all-report.sh
cpp-report.sh Updating download scripts... 2010-03-02 17:30:52 +00:00
cpp_sca-m1.sh
cpp_sca-m2.sh
cpp_sca-m3.sh
cpp_sdo-m1.sh
cpp_sdo-m2.sh
cpp_sdo-m3.sh
das-beta1.sh
das-beta2.sh
das-m2.sh
das-report.sh Updating download scripts... 2010-03-02 17:30:52 +00:00
README
sca-11.sh
sca-12.sh
sca-13.sh
sca-14.sh
sca-15.sh
sca-16.sh
sca-090.sh
sca-091.sh
sca-099.sh
sca-100.sh
sca-101.sh
sca-121.sh
sca-131.sh
sca-132.sh
sca-151.sh
sca-m2.sh
sca-report.sh Updating download scripts... 2010-03-02 17:30:52 +00:00
sca-test.sh
sdo-1.sh
sdo-11.sh
sdo-111.sh
sdo-beta1.sh
sdo-m2.sh
sdo-report.sh Updating download scripts... 2010-03-02 17:30:52 +00:00

These are set of scripts used to count the release downloads from apache http stat logs.

1.Where to download the http stat logs from 
The logs are located in people.apache.org under x1/logarchive/people/<year>/<month>/??.bz2
I usually download them on a folder structure like : <year>/<month>

2.Get ready to process the scripts
Unzip the log files
   >bunzip2 ??.bz2
Concatenate together all the logs from a given month
   >ls -1 | xargs -i -t cat {} >> ../<year>-<month>.log

3.Running the scripts
all-report.sh	- runs all the scripts
   >./all-reports.sh

???-report.sh	- runs reports for a given type of release/project (e.g cpp, sca, sdo, das)
   >./sca.report.sh


4.Adding a new month to be processed
Edit the ???-report.sh and add a new line for the new month as appropriated
   >./sca-099.sh 2007/2007-09.log >> sca-downloads.log

5.Adding a new project/release
Create a new script that filters the specific project/release
   >total=$(cat $1 | grep tuscany-das-1.0-incubating-beta1 | awk '{ print $2 }' | uniq | wc -l)
echo $1 $total

The update the other scripts to properly call this to process a given month