wx13/word_freq.sh

Created March 1, 2013 16:47

Star (0) You must be signed in to star a gist
Fork (0) You must be signed in to fork a gist

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/wx13/5065970.js"></script>
Save wx13/5065970 to your computer and use it in GitHub Desktop.

Download ZIP

one-liner to get word frequencies from a file

Raw

word_freq.sh

strings $1 | sed 's/[^a-zA-Z0-9]/\n/g' | egrep '[a-zA-Z]{5}' | tr '[:upper:]' '[:lower:]' | awk '{++a[$1]}END{for(x in a){print a[x], x}}'

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment