This looks for data in a directory with many files, selects 1000 of them randomly, gets their sizes, and then uses mymaths.py to compute the mean, variance and stdev and other properties
du -k $(find /directory/with/many/files -type f) | \
shuf -n 1000 | \
pick-columns.sh 1 | \
mymaths.py mean variance stdev multimode min maxOutput:
mean: 21062.96
variance: 5526661943.796364
stdev: 74341.52233978239
multimode: 4.0
min: 4.0
max: 583948.0