Marsyas has been used for a variety of projects in both academia and industry.
Yahoo Research
ORBIT Project (BBC Research)
VISNET I & II EU FP6 NoE Projects
INESC Porto
Teligence
Moodlogic
last.fm
MusicEmo - Eric Yang, NTU, Taiwan
Musicream
Musie Mood
Orelia - Sound Source Recognition
Multi-Label classification of music
Dancing Robot with Marsyas
Personalized Multimodal Music Search
Modeling Emotional Content of Music
ASTA - Automatic Subtitle Timing Annotator
SndPeek
Analysis of Audio Features in Broadcast Sports Video
vivi
Yahoo Research
Yahoo Research and the Yahoo Media Group have been using Marsyas to analyze songs in our 2-million song database. We are interested in both how the songs are related to each other, so we can find similar songs, and how they differ, so we can characterize people's musical interests. Marsyas provides us with an easy-to-use platform that computes many of the features we think are useful. But more importantly, Marsyas is a common platform for acoustic signal processing, so we can report results that are easy for other research labs to replicate. -
Malcolm Slaney
http://www.musiclibre.org/research/
ORBIT Project (BBC Research)
ORBIT - “Object Re-configurable Broadcast Infrastructure Trial”, a contract pilot project between the BBC R&D (http://www.bbc.co.uk/rd/index.shtml) and INESC Porto (http://www.inescporto.pt): development of automatic audio segmentation and classification tools using Marsyas (September 2001 ~ September 2002).
http://www.bbc.co.uk/orbit/index.shtml
VISNET I & II EU FP6 NoE Projects
VISNET I & II - “NETworked audio VISual media technologies” (FP6-2002-IST-1 and FP6-2005-IST-41 European Union Network of Excellence Projects, respectively): Marsyas is used at Audio and Music analysis work packages by some of the partners (December 2003 ~ June 2009).
http://www.visnet-noe.org/
INESC Porto
INESC Porto - Institute for Systems and Computer Engineering of Porto - is a private non-profit association, recognized as Public Interest Institution, that has been recently appointed as Associated Laboratory. It is located in Porto, Portugal. At the Multimedia and Telecommunications Unit (UTM), INESC Porto researchers have used and contributed to Marsyas development in audio, video and multimodal analysis and processing. More info can be found at:
http://www.inescporto.pt/~lmartins/
http://telecom.inescporto.pt/~lfpt/main/pmwiki.php?n=Main.MarsyasX
http://www.inescporto.pt
Teligence
Marsyas was used to design and develop an in-house tool for gender classification (male/female/silence) for voice messages. The system achieves classification of approximately 90% and is running on 10 hubs processing about 25000 recordings (1-2 minutes each) per day. The project was initiated by Paul Snider with consulting by George Tzanetakis.
http://www.teligence.net/
Moodlogic
Marsyas was used to design and prototype the audio fingerprinting technology used to link user files to metadata and fix ID3 tags by the Moodlogic client. The fingerprint is small (about 300 bytes/file), is fast to compute, and matching is performed to a database of 1.5 million songs)
http://www.moodlogic.com/
last.fm
"At last.fm we used Marsyas to design and validate prototypes, and to
quickly test ideas. The latest version dramatically improved its quality
and performance - if you are doing serious MIR your should definitely
give it a try!" - Norman Casagrande, Head of Music Research
http://www.last.fm
MusicEmo - Eric Yang, NTU, Taiwan
We identified and analyze three critical issues of music emotion recognition:
the subjectivity of emotion perception, the ambiguity of categorical emotion
models, and the semantic gap between low-level audio signals to high-level
emotion. Some results of the previous two issues have been published. To see
more detail, please see the website. Marsyas is used in our system for feature extraction. Our experiments show the strength of the features in Marsyas.
http://mpac.ee.ntu.edu.tw/~yihsuan/
Musicream
Musicream is a novel music playback interface that lets users
unexpectedly come across musical pieces they like. It facilitates active,
flexible, and unexpected browsing. For example, the "similarity-based"
sticking function enables user to easily pick out and listen to similar
pieces from a streaming "flow" of music. Marsyas is used to
automatically extract a single feature vector that characterizes the
content of a particular music piece. That vector is used for color
visualization of the audio content as well as to support the
"similarity-based" sticking function. Marsyas provided an easy way to
extract audio content information and enabled us to concentrate on
designing and developing the user interface.
Masataka Goto
Senior Research Scientist, AIST, Japan
http://staff.aist.go.jp/m.goto/Musicream/
Musie Mood
our goal is to develop an integrated system to visualize and query large music libraries. The layout is controlled by the user and similarity of songs is measured in perceptual terms. We use Marsyas to extract structural features which have perceptual interpretation (e.g. tempo, loudness, beat strength, etc...).
- Vladimir G. Kim, Steven Bergner, Torsten Möller.
GrUVi lab, Simon Fraser Univeristy, Canada.
http://gruvi.cs.sfu.ca/researchProject.php?s=373
Orelia - Sound Source Recognition
Orelia (
http://www.orelia.fr) is using Marsyas as a calculation engine in his Sound Source Recognition Software (OSSR). OSSR automatically recognize noise sources like aircraft noise, railway noise, road traffic noise etc. The product is used by acousticians to perform environmental noise assessment, complementing the sound pressure level.
Marsyas provides fast calculation and helps OSSR to process large amounts of audiofiles in a very resonable time - Boris Defréville and Rémi Poittevin.
http://www.orelia.fr
Multi-Label classification of music
In our project, the automatic detection of emotion in music was modeled as a multi-label classification task. Marsyas was used for the extraction of rhythmic and timbre features on a new collection of 593 songs. We compared the predictive performance of four multi-label classification algorithms. Furthermore, the predictive power of each feature was evaluated using a new multi-label feature selection method.
Konstantinos Trohidis - Grigorios Tsoumakas
http://mlkd.csd.auth.gr/multilabel.html
Dancing Robot with Marsyas
Marsyas is being used as the rhythmic interface beyond dancing robots, under a PhD project at LIACC - Artificial Intelligence and Computer Science Laboratory, and INESC Porto; which began in 2008. This research focus on multidisciplinary aspects of interactive music and dancing robotic systems, and its applications, being mainly founded on the interconnection of music, rhythm, perception, emotion, movement, and interaction in an expression of dance, as a form of art and sonification. Till date we developed a Lego-NXT-based robot, which uses Marsyas to analyze low-level aspects of rhythm, through onset detection, embodying the resultant rhythmic events with user-defined dance movements. I would like to express my gratitude to the MARSYAS' comunity for making this possible. - João Lobato Oliveira, PhD student at FEUP, Porto, Portugal.
http://paginas.fe.up.pt/~ee03123/
Personalized Multimodal Music Search
Marsyas has been helping a lot in our music search prototype called
"Personalized Multimodal Music Search", built in Dr Wang Ye's group at
School of Computing, National University of Singapore. We mainly use
Marsyas for music classification using audio signals. More
specifically, we constructed our own classification networks using
Marsyas modules for genre, mood, instrument, and vocalness
classifications. The class activation probabilities in the
classification results were used as audio signatures to represent
different music dimensions (namely, genre, mood, instrument and
vocalness). The music search prototype is publicly accessible from
the link below . Besides searching music by its content, the search engine also
provides music search by keywords. In addition, the system allows users to
personalize different music dimensions to do their search by keyword or
example (only mp3 examples can be recognized for now) .
For more details of the system, please refer to the SIGIR'09 paper titled
"CompositeMap: A Novel Framework for Music Similarity Measure". We really
benefited a lot from Marsyas framework in implementing the audio processing
module of our system. We thank all the contributors of Marsyas for their great efforts.
- Bingun (Eddie) Zhang, Ye Wang, National University of Singapore
http://mir.comp.nus.edu.sg
Modeling Emotional Content of Music
In this project, the emotional content of music was modeled as a function of music features. The model's inputs were time varying features; many features were extracted using Marsyas. The model's output was a 2-dimensional time varying signal that quanitified emotion. The emotion signal was generated by volunteers who evaluated the emotional content of several musical selections and models were created using system identification techniques.
M. D Korhonen
Systems Design Engineering, University of Waterloo, Ontario, Canada, 2004
http://www.sauna.org/kiulu/emotion.html
ASTA - Automatic Subtitle Timing Annotator
Subtitling a video/song is a tedious task, not only one has to write the subtitle, but also one has to specify its timing (start and end times). ASTA project tries to automatically determine the subtitles timing based on the, possibly polyphonic, audio input. We found Marsyas to be the most suitable tool for both signal processing (i.e. feature extraction) and machine learning (i.e. training and classification). Beside its efficiency, it provides such a complete solution to audio-analysis that we didn't need any other library. We thank Marsyas team for open-sourcing such a great project. Mohamed Abdel Maksoud (http://rw4.cs.uni-sb.de/people/mohamed.shtml).
http://sourceforge.net/projects/asta-annotator/
SndPeek
sndpeek is just what it sounds (and looks) like:
* real-time 3D animated display/playback
* can use mic-input or wav/aiff/snd/raw/mat file (with playback)
* time-domain waveform
* FFT magnitude spectrum
* 3D waterfall plot
* lissajous! (interchannel correlation)
* rotatable and scalable display
* freeze frame! (for didactic purposes)
* real-time spectral feature extraction (centroid, rms, flux, rolloff)
* available on MacOS X, Linux, and Windows under GPL
authors: Ge Wang | Perry Cook | Ananya Misra | George Tzanetakis (MARSYAS)
date: 2003 - present
http://soundlab.cs.princeton.edu/software/sndpeek/
Analysis of Audio Features in Broadcast Sports Video
Multimedia Lab (Ghent University - IBBT) has been using Marsyas for the
analysis of audio features in broadcast sports video.
These audio features are used to detect semantically meaningful audio
segments (e.g., cheering of the audience, commentary, whistles). This allows
extracting specific events from the sports video that are useful for
different types of applications (sports summarization and highlighting).
- Chris Poppe, Multimedia Lab, Ghent University - IBBT, Belgium.
http://multimedialab.elis.ugent.be/
vivi
Vivi is a computer program which performs music at approximately the level of a student with one year of practice.
Why write a computer program that can simulate an inexperienced musician playing a low-quality instrument, when I have an excellent-quality cello and decent viola?
Well, in 70 years I’ll be over 100. Barring miraculous advances in medicine, I won’t be able to exert enough force with my right hand, my left hand won’t be able to move fast enough, my reactions won’t be fast enough to adjust my actions as necessary. In short, I won’t be able to play cello.
Musical creativity is hindered by physical constraints.
http://percival-music.ca/vivi.html