Marsyas has been used for a variety of projects in both academia and industry:

HEARBO - Hearing Robot

Marsyas was integrated with HARK (HRI-JP Audition for Robots with Kyoto University) Robot Audition System for music processing. It has already been used by HRI-JP, in collaboration with INESC Porto and LIACC-FEUP, for the development of an interactive robot dancing system HEARBO, where a humanoid robot simultaneously responds to the beat of music and to the speech of a human interactor in a real-world scenario. HARK-Marsyas could be the baseline of music listening robots in the future.

Genetic Programming for Sound Synthesizer Evolution

Simon Fraser University, School of Interactive Arts and Technology:

Marsyas is used to design and prototype a sound similarity function that acts as fitness function in our system. This function leads our algorithm in the space of all possible synthesizers to find the most relevant architecture and parameters to produce a given target sound. The Marsyas Python wrappers have been successfully used in our distributed implementation.

Yahoo Research

Yahoo Research and the Yahoo Media Group have been using Marsyas to analyze songs in our 2-million song database. We are interested in both how the songs are related to each other, so we can find similar songs, and how they differ, so we can characterize people's musical interests. Marsyas provides us with an easy-to-use platform that computes many of the features we think are useful. But more importantly, Marsyas is a common platform for acoustic signal processing, so we can report results that are easy for other research labs to replicate. Malcolm Slaney

ORBIT Project (BBC Research)

ORBIT - Object Re-configurable Broadcast Infrastructure Trial, a contract pilot project between the BBC RD ( and INESC Porto ( development of automatic audio segmentation and classification tools using Marsyas (September 2001 - September 2002).

VISNET I & II EU FP6 NoE Projects

VISNET I - II - NETworked audio VISual media technologies (FP6-2002-IST-1 and FP6-2005-IST-41 European Union Network of Excellence Projects, respectively) Marsyas is used at Audio and Music analysis work packages by some of the partners (December 2003 - June 2009).


INESC Porto - Institute for Systems and Computer Engineering of Porto - is a private non-profit association, recognized as Public Interest Institution, that has been recently appointed as Associated Laboratory. It is located in Porto, Portugal. At the Multimedia and Telecommunications Unit (UTM), INESC Porto researchers have used and contributed to Marsyas development in audio, video and multimodal analysis and processing.


Marsyas was used to design and develop an in-house tool for gender classification (male/female/silence) for voice messages. The system achieves classification of approximately 90% and is running on 10 hubs processing about 25000 recordings (1-2 minutes each) per day. The project was initiated by Paul Snider with consulting by George Tzanetakis.


Marsyas was used to design and prototype the audio fingerprinting technology used to link user files to metadata and fix ID3 tags by the Moodlogic client. The fingerprint is small (about 300 bytes/file), is fast to compute, and matching is performed to a database of 1.5 million songs)

MusicEmo - Eric Yang, NTU, Taiwan

We identified and analyze three critical issues of music emotion recognition the subjectivity of emotion perception, the ambiguity of categorical emotion models, and the semantic gap between low-level audio signals to high-level emotion. Some results of the previous two issues have been published. To see more detail, please see the website. Marsyas is used in our system for feature extraction. Our experiments show the strength of the features in Marsyas.


Musicream is a novel music playback interface that lets users unexpectedly come across musical pieces they like. It facilitates active, flexible, and unexpected browsing. For example, the similarity-based sticking function enables user to easily pick out and listen to similar pieces from a streaming flow of music. Marsyas is used to automatically extract a single feature vector that characterizes the content of a particular music piece. That vector is used for color visualization of the audio content as well as to support the similarity-based sticking function. Marsyas provided an easy way to extract audio content information and enabled us to concentrate on designing and developing the user interface.Masataka GotoSenior Research Scientist, AIST, Japan

Musie Mood

Our goal is to develop an integrated system to visualize and query large music libraries. The layout is controlled by the user and similarity of songs is measured in perceptual terms. We use Marsyas to extract structural features which have perceptual interpretation (e.g. tempo, loudness, beat strength, etc...).- Vladimir G. Kim, Steven Bergner, Torsten Muller. GrUVi lab, Simon Fraser Univeristy, Canada.

Orelia - Sound Source Recognition

Orelia is using Marsyas as a calculation engine in his Sound Source Recognition Software (OSSR). OSSR automatically recognize noise sources like aircraft noise, railway noise, road traffic noise etc. The product is used by acousticians to perform environmental noise assessment, complementing the sound pressure level. Marsyas provides fast calculation and helps OSSR to process large amounts of audiofiles in a very resonable time - Boris Defreville and Remi Poittevin.

Multi-Label classification of music

In our project, the automatic detection of emotion in music was modeled as a multi-label classification task. Marsyas was used for the extraction of rhythmic and timbre features on a new collection of 593 songs. We compared the predictive performance of four multi-label classification algorithms. Furthermore, the predictive power of each feature was evaluated using a new multi-label feature selection method. Konstantinos Trohidis - Grigorios Tsoumakas

Dancing Robot with Marsyas

Marsyas is being used as the rhythmic interface beyond dancing robots, under a PhD project at LIACC - Artificial Intelligence and Computer Science Laboratory, and INESC Porto; which began in 2008. This research focus on multidisciplinary aspects of interactive music and dancing robotic systems, and its applications, being mainly founded on the interconnection of music, rhythm, perception, emotion, movement, and interaction in an expression of dance, as a form of art and sonification. Till date we developed a Lego-NXT-based robot, which uses Marsyas to analyze low-level aspects of rhythm, through onset detection, embodying the resultant rhythmic events with user-defined dance movements. I would like to express my gratitude to the MARSYAS' comunity for making this possible. - Joao Lobato Oliveira, PhD student at FEUP, Porto, Portugal.

Modeling Emotional Content of Music

ASTA - Automatic Subtitle Timing Annotator

Subtitling a video/song is a tedious task, not only one has to write the subtitle, but also one has to specify its timing (start and end times). ASTA project tries to automatically determine the subtitles timing based on the, possibly polyphonic, audio input. We found Marsyas to be the most suitable tool for both signal processing (i.e. feature extraction) and machine learning (i.e. training and classification). Beside its efficiency, it provides such a complete solution to audio-analysis that we didn't need any other library. We thank Marsyas team for open-sourcing such a great project. Mohamed Abdel Maksoud (


sndpeek is just what it sounds (and looks) like * real-time 3D animated display/playback * can use mic-input or wav/aiff/snd/raw/mat file (with playback) * time-domain waveform * FFT magnitude spectrum * 3D waterfall plot * lissajous! (interchannel correlation) * rotatable and scalable display * freeze frame! (for didactic purposes) * real-time spectral feature extraction (centroid, rms, flux, rolloff) * available on MacOS X, Linux, and Windows under GPL authors Ge Wang | Perry Cook | Ananya Misra | George Tzanetakis (MARSYAS) date 2003 - present

Analysis of Audio Features in Broadcast Sports Video

Multimedia Lab (Ghent University - IBBT) has been using Marsyas for the analysis of audio features in broadcast sports video. These audio features are used to detect semantically meaningful audio segments (e.g., cheering of the audience, commentary, whistles). This allows extracting specific events from the sports video that are useful for different types of applications (sports summarization and highlighting). - Chris Poppe, Multimedia Lab, Ghent University - IBBT, Belgium.


Vivi is a computer program which performs music at approximately the level of a student with one year of practice. Why write a computer program that can simulate an inexperienced musician playing a low-quality instrument, when I have an excellent-quality cello and decent viola? Well, in 70 years Ill be over 100. Barring miraculous advances in medicine, I wont be able to exert enough force with my right hand, my left hand wont be able to move fast enough, my reactions wont be fast enough to adjust my actions as necessary. In short, I wont be able to play cello. Musical creativity is hindered by physical constraints.