This page surveys the numerous freely-downloadable software programs for conducting acoustic analyses of speech recordings.

Highlighted resources: Audacity (under 1st section), Praat, and Wavesurfer (both under 2nd section) are the most popular.

For online resources where you can learn more about acoustics, see the Acoustic Phonetics section on the Web Portal page.


+

Programs for handling recordings

-

Programs for handling recordings

Audacity

  • Overview: Audacity is a free, open source, cross-platform software for recording and editing sounds. For more information, see the Audacity homepage and the Wikipedia page.
  • Download: Audacity can be downloaded from the project's SourceForge page
  • Documentation: See Audacity's online Manual and Wiki. You can also take a workshop on Audacity from Indiana University's IT Training department (or download the workshop's files and teach yourself).
  • Support: See the help page on the Audacity site for the various available resources, including a forum and mailing lists.

Miscellaneous


+

Software with a graphical user interface

-

Software with a graphical user interface

For information on the commercial program "Adobe Audition" (formerly Cool Edit Pro), see the download page for Creative Suite 6 Production Premium on IUware (Windows, Mac) and the workshops offered through IT Training.

Note that a comparison of the different software programs listed below is available from Peter Roach (University of Reading).

Praat

WaveSurfer

EMU Speech Database System

Speech Filing System

Speech Analyzer


+

Packages for scripting languages

-

Packages for scripting languages

R

  • Overview: R is free software programming language and software environment for statistical computing and graphics. Further details can be found at the R homepage and its Wikipedia page. See the Statistics page for ways to learn R at IU.
  • Official releases: R has several packages that are useful for phonetics:
    • phontools: ("Functions for phonetics in R")
    • phonR: ("R tools for phoneticians and phonologists")
    • vowels: ("Vowel manipulation, normalization, and plotting")
    • tuneR: ("Analysis of music and speech")
    • signal: ("Signal processing")
    • seewave: ("Sound analysis and synthesis")
    • audio: ("Audio interface for R")
  • Interfaces: R can control many of the other phonetics software programs described on this page:

Python

MATLAB / GNU Octave


+

Libraries for advanced programming

-

Libraries for advanced programming

Snack Sound Toolkit

Entropic Signal Processing System (ESPS)

  • History: Originally a commercial product of Entropic Research Laboratory, Inc. (commonly abbreviated 'Entropic'), the following bundle of software was extremely popular in phonetics labs around the world in the 1990s:
    1. Entropic Signal Processing System (ESPS): a collection of over 200 UNIX commands and programming libraries (written in C) for speech signal processing
    2. waves+: a set of functions for interactive data visualization and manipulation of data processed by ESPS. The program that forms the core of the waves+ package is known as 'xwaves'.
    3. EnSig: A higher-level graphical user interface for setting waves+ preferences and behavior with interactive prompts

    After Entropic was acquired by Microsoft in 1999, the rights to the final legacy version of the ESPS source code (i.e. #1 above, but not #2 or #3) was donated to the Center for Speech Technology (Centrum för TalTeknologi, CTT) in the Department of Speech, Music, and Hearing (Tal musik och hörsel, TMH) at the Royal Institute of Technology (Kungliga Tekniska högskolan, KTH) in Stockholm, Sweden. This code was incorporated into the Snack Sound Toolkit. (See above.)

  • Download: The ESPS source code can be obtained in any the following ways:
  • Documentation: The full manual is available from Roberto Togneri (University of Western Australia).

Edinburgh Speech Tools Library

Speech Signal Processing Toolkit

  • Overview: The Speech Signal Processing Toolkit (SPTK) is a set of speech signal processing tools for Unix environments.
  • Creators: SPTK working group at Tokyo Institute of Technology and Nagoya Institute of Technology (See the readme.)
  • Download: SPTK can be downloaded from the project's SourceForge page.
  • Documentation: A link to the SPTK manual can be found on the SPTK home page.

Advanced Speech Signal Processor

  • Overview: The Advanced Speech Signal Processor (ASSP) library provides functionality for handling speech signal files - both as a command-line tool by itself ('libassp') and with a Tcl extension ('tclassp').
  • Creators: Lasse Bombien (Ludwig Maximilian University of Munich) and Michel Scheffers (Christian Albert University of Kiel)
  • Download: Both libassp and tclassp can be downloaded from the project's SourceForge page.
  • Documentation: A manual is available for tclassp.
  • Support: Bugs and feature requests can be submitted on the project's SourceForge page. See also the mailing list.