Zounds is a python library for building audio feature extraction pipelines in a delcarative way, from re-usable building blocks. It provides many of the primitives you’ll need to start experimenting with audio, like Short-Time fourier and discrete cosine transforms, chroma, and bark-frequency cepstral coefficients.

My goal is to make installing zounds as painless as possible, but as of this writing, there are a few small hurdles you’ll need to clear before you can pip install zounds.

Numpy and Scipy

Zounds depends on numpy and scipy. If you don’t already have them installed (if you do any kind of scientific or numerical computing with python, you surely do), I highly recommend Continuum Analytics’ Anaconda distribution. This will get you up and running much more quickly than building numpy and scipy from source yourself.

`libsndfile 1.0.26`

Zounds depends on the most recent release of libsndfile, which, as of this writing, is not yet available from Ubuntu’s default apt sources. To get this latest version of libsndfile installed prior to installing zounds, there’s a bash script in the zounds git repository you can run.

Installing From PyPI

Once you’ve got all the dependencies out of the way, your best bet will be to simply pip install zounds.

Installing From Source

Of course, if you need to be on the bleeding edge, you can clone the repository from github, or download the zipped source and run python setup.py install from the source directory.

Zounds Repl

For quick experiments, zounds provides a zounds-quickstart script, which you can run from anywhere after installing the package. This will start an interactive repl that runs in your browser (any modern browser with WebAudio support will do). The repl should behave just like the one you’re accustomed to running from the console, but it also allows you to playback audio and view spectrograms, for a richer experience. To start the server, just run:

zounds-quickstart --datadir data --port 9999

Then, in your browser, try the following:

>>> _id = Document.process(meta='http://phatdrumloops.com/audio/wav/youregettn.wav')
>>> doc = Document(_id)

To listen to the original audio, type…

>>> doc.ogg # this should cause an html5 audio element to appear

…and you should see an html5 audio element that will play the original audio, like this:

To see the sound’s spectral centroid over time, type…

>>> doc.centroid # this should cause a graph of spectral centroid to appear

…and you should see an image like this: Spectral Centroid

Then, let’s try sorting the audio frames by spectral centroid, ascending:

>>> import numpy as np
>>> indices = np.argsort(doc.centroid)
>>> synth.synthesize(doc.dct[indices]) # a new html5 audio element should appear.  listen to it!

At this point, you shoud see an html5 audio element that will play the sound, ordered by spectral centroid, ascending