Austrian Composer Peter Ablinger has transformed a child speaking so that it can be played as MIDI events on a mechanically-controlled piano, making the piano a kind of speech speaker. Via Matrixsynth, the readers at Hack a Day get fairly involved with how this may be working.

It seems not quite accurate to describe this as vocoding in the strictest sense, so much as a simple transformation to a (much) lower frequency resolution – that is, the 88 keys of the piano. Ablinger, for his part, describes the events as “pixels.” It’s pretty extraordinary that without a bandpass filter, you get something approximating the noisy sibilance of the speech, but this seems to be the result of having lots of events (that is, lots of resolution in terms of time). Edit: Listening again, the short answer to how you can hear so much of the voice through the piano seems to be, you can’t; the original is almost certainly mixed in. It’s nonetheless an interesting effect, and I’d like to hear the piano on its own. In other words, the basic process is, 1) convert the sound spectrum of the recorded voice to a series of MIDI events, and 2) play back the translated MIDI file. You can see that the MIDI playback is accomplished with Pd (Pure Data) running on a Windows Linux/KDE netbook, though it’s not clear what was used to do the original conversion. (The screen shot with side-by-side audio and MIDI appears as though it may be for demonstration purposes, only.)

Correction: The work is absolutely done in custom software developed by the composer in Pd (Pure Data). It’s an ideal tool for the job, and free and open source. I wouldn’t dare try to replicate the results here, but this is fantastic inspiration for playing with sound in Pd.

One Windows tool that’s capable of the job is TS Audiotomidi, as observed by Hack a Day spacecoyote. Whether or not that’s what’s at work here – and it may well be – that utility is itself interesting. Edit: Yeah, far more likely the whole thing was done in Pd. And Pd should be up to the task.

TS-AudioToMIDI

Of course, this is to say nothing of the lovely work done on the mechanical piano. It’s a beautiful piece. Here’s hoping some government bureaucrats got the message of the declaration. Now, we just need a chorus of something really loud – say a thousand trumpets – shouting out the Universal Declaration of Human Rights.

audiotomidi

55 Responses

  1. Dub

    Gorgeous! Just gorgious.

  2. Bjorn Westergard

    I would have never guessed that a piano's frequency specificity would be sufficient for this kind of thing!

    It would be cool if he wrote a piece that transitioned gradually from something more recognizably "music" to "speech".

  3. Bjorn Westergard

    I also wonder if he deliberately chose a kid with sibilant heavy speech.

  4. vcd

    Interesting piece, but I'm not so sure you would be able to tell it was based on speech if both the original audio was no played on top of it, or if the transcription was not being shown to read in time. The work

    done with the piano is pretty stunning though.

    Leave it to Peter to turn something borderline sensational into something completely sensationalized (re: last paragraph).

  5. shamburglar

    anybody know of any decent audio to midi apps for Mac?

  6. Adrian Anders

    TS-AudioToMIDI dev should invest the time to make a VST plug-in version of his software. I would be interested in it then.

  7. Dano

    @shamburglar

    Similar thing for Mac:
    http://widisoft.com/english/mp3-midi-products.htm…

  8. Jay Smith

    I use WIDI for mac. I made a "player piano" video a while back for the ohm64 using it and show how it is done here http://www.youtube.com/watch?v=KkKESe_QdKE

  9. nick kent

    Well he's encoding in a seemingly similar process vocoder but decoding in a non-traditional way.

    If you think he's mixing the original with the piano then he's definitely cheating. It doesn't look realtime to me but perhaps that might be a reason why it would be somewhat excusable to hear the original

    If it is not cheating I think is very impressive. A traditional vocoder adjusts each frequency band's volume continuously. A piano just has a velocity, short attack and long decay that can be dampened, so besides the rich harmonic pitched sound vs bandpass decoding.

  10. KULTURTECHNO

    Quadraturen…

    Als Nachtrag zum Palinsong hier ein just veröffentlichtes YouTube-Filmchen über Peter Ablingers Quadraturen, bei denen ein mechanisch gespieltes Klavier Sprache imitiert.

    (via)

    ……

  11. kobe

    one word: Melodyne.

  12. [...] Wie genau das funktioniert und warum Peter Ablinger erklärt der Film ausreichend. (Direktlink, via Create Digital Music) [...]

  13. Ivica Bukvic

    Looks to me that the desktop was actually running Linux/KDE with Pd, rather than Windows. Also, the person at the computer looks awful lot like Winfried Ritsch from IEM where they do a lot of work with Pd. So, it seems unlikely that the Windows app in question is being used here and more likely that the whole thing is done in in Pd…

  14. Peter Kirn

    @Ivica: You're completely right. That is indeed very clearly KDE. And there's a big honking "X" in the other window. ;) So, yes, I agree, and I should get back to learning more Pd signal processing kung fu.

  15. Stij

    Wow. I've often wondered if something like this was possible, but I've never had any idea of how to implement it. If this is legit then it's very impressive.

  16. jens-oliver

    Here's a piano only video http://vimeo.com/1483630. Not the same text and with additional notes. Very amazing.

  17. Stij

    Hmm…yeah, it isn't nearly as intelligible without the original voice mixed in, but you can still hear some of the sibilants.

    It also sounds extremely creepy!

  18. GMM

    Wow this is amazing. And it is only a piano. Imagine when you have a whole orchestra scored and conducted to reproduce speech, and then further on, a whole orchestra running in realtime as a vocoder!

  19. Peter Kirn

    Here we go – here's the full explanation of how the whole thing works, including a blurry image of the Pd patch.

    http://ablinger.mur.at/docu11.html

    I must say, I love the idea of pixelation – this is something that, as a general approach, could be attached to a wide variety of work.

    Oh, and I actually prefer the more abstract rendition minus the overlaid speech. Who needs intelligibility? It's gorgeous.

  20. Fishboy

    Why are so many commenters focusing on the sibilants? What makes them more interesting than other phonemes/classes of phonemes?

  21. Peter Kirn

    I'm not a linguist, but sibilants are essential to understandability, and they're the thing that would theoretically be hardest to hear on a piano which is least able to produce broad-band noise (versus formants/vowels). If you listen to the piano without the voice, in fact, it's what seems to be largely missing.

  22. Fishboy

    So are you saying you hear vowels in the video without the actual voice layered in? http://vimeo.com/1483630 I couldn't hear a voice in that one, myself, at least not well enough to make out any words or phonemes – vowels, sibilants, or otherwise. I guess to my ear it sounded vaguely vocal. But anyway, I thought the most interesting would be vowels, especially diphthongs, since the language used is English.

  23. John

    I'm not clear, how is this concept of "pixels" really any different than that of wavelets?

    As an aside, I'm not wholly convinced that they *are* mixing in the original audio on the feature video. Upon hearing the kid's plain voice, his formant seems different than what is coming from the piano audio. Is there anything other than subjective listening which would indicate that they are mixing in the original audio? The Vimeo clip Fishboy links to is difficult to compare, simply because of the vastly different acoustics, different piano AFAIK, and it doesn't seem to have the dampening that the one above does.

    Interesting work regardless of this point.

  24. Dub

    Also covered by MeFi

  25. [...] by Martin Poulter on 8 October 2009 An amazing hardware hacking project: a mechanical piano, computer-controlled, becomes a speech [...]

  26. [...] kraftfuttermischwerk & createdigitalmusic] Tweet This!Share this on FacebookPost this to MySpaceShare this on del.icio.usDigg this!Share [...]

  27. [...] # Create Digital Music » The Speaking Piano, and Transforming Audio to MIDI [...]

  28. [...] Read | Permalink | Email this | Comments Go to Source [...]

  29. [...] Read | Permalink | Email this | Comments Loading… @import url("http://www.google.com/uds/css/gsearch.css&quot ;) ; window._uds_vbw_donotrepair = true; @import url("http://www.google.com/uds/solutions/videobar/gsvideobar.css&quot ;) ; .playerInnerBox_gsvb .player_gsvb { width : 320px; height : 260px; } function LoadVideoBar() { var videoBar; var options = { largeResultSet : !true, horizontal : true, autoExecuteList : { cycleTime : GSvideoBar.CYCLE_TIME_MEDIUM, cycleMode : GSvideoBar.CYCLE_MODE_LINEAR, executeList : ["ytchannel:theworacle","ytchannel:luckymauro","ytchannel:mttdx"] } } videoBar = new GSvideoBar(document.getElementById("videoBar-bar"), GSvideoBar.PLAYER_ROOT_FLOATING, options); } // arrange for this function to be called during body.onload // event processing GSearch.setOnLoadCallback(LoadVideoBar); Filed under: Engadget No Comments Comments (0) Trackbacks (0) ( subscribe to comments on this post ) [...]

  30. piker

    so what. he got a computer. good for him.

  31. [...] Read | Permalink | Email this | Comments Tagged with: art    austria    blackmothsuperrainbow    engadget    internet    midi    music    peter ablinger    peterablinger    pure data    voice [...]

  32. [...] Read | Permalink | Email this | Comments [...]

  33. [...] appeared on Engadget on Fri, 09 Oct 2009 10:07:00 EST. Please see our terms for use of feeds.Read | Permalink | Email [...]

  34. [...] appeared on Engadget on Fri, 09 Oct 2009 10:07:00 EST. Please see our terms for use of feeds.Read | Permalink | Email this | Comments Comments [0]Digg [...]

  35. [...] appeared on Engadget on Fri, 09 Oct 2009 10:07:00 EST. Please see our terms for use of feeds.Read | Permalink | Email this | Comments Related ArticlesBookmarksTags [...]

  36. [...] appeared on Engadget on Fri, 09 Oct 2009 10:07:00 EST. Please see our terms for use of feeds.Read | Permalink | Email [...]

  37. [...] Read | Permalink | Email this | Comments Tags : Art , austria , Black Moth Super Rainbow , BlackMothSuperRainbow , European Environmental Criminal Court , EuropeanEnvironmentalCriminalCourt , hack , midi , mod , Peter Ablinger , PeterAblinger , Piano , pure data , PureData , speech , vocoder , voice No comments for this entry yet… [...]

  38. [...] appeared on Engadget on Fri, 09 Oct 2009 10:07:00 EST. Please see our terms for use of feeds.Read | Permalink | Email this | Comments Categories: Black Moth Super [...]

  39. [...] appeared on Engadget on Fri, 09 Oct 2009 10:07:00 EST. Please see our terms for use of feeds.Read | Permalink | Email [...]

  40. [...] Read | Permalink | Email this | Comments Tagged with: 9th-2009    austria    hack    midi    mod    networking    neutral    peter ablinger    peterablinger    piano    Politics    Sport [...]

  41. [...] appeared on Engadget on Fri, 09 Oct 2009 10:07:00 EST. Please see our terms for use of feeds.Read | Permalink | Email this | Comments Posted on October 9, 2009 at [...]

  42. [...] appeared on Engadget on Fri, 09 Oct 2009 10:07:00 EST. Please see our terms for use of feeds.Read | Permalink | Email [...]

  43. [...] Read | Permalink | Email this | Comments [...]

  44. [...] Read | Permalink | Email this | Comments Bookmark This Post Comments (0) [...]

  45. [...] Read | Permalink | Email this | Comments [...]

  46. [...] Read | Permalink | Email this | Comments Related Reading: Moon In My Room Batman Begins [UMD for PSP] iTouchless Stainless-Steel Hands-Free 13-Gallon Infrared Automatic Trash Can Get Smart (Single-Disc Widescreen Edition) Travel Accessories Samsonite UK Grounded Uncle Milton Rainbow In My Room Share this on del.icio.usStumble upon something good? Share it on StumbleUponTweet This!Share this on FacebookPost this to MySpaceShare this on TechnoratiAdd this to Google Bookmarks Permalink|Comments RSS Feed – Trackbacks are closed|post a comment. [...]

  47. [...] Read | Permalink | Email this | Comments Rate this topic: (No Ratings Yet) Popularity: 0 You can follow any responses to this entry through the RSS 2.0 feed. You can skip to the end and leave a response. Pinging is currently not allowed. [...]

  48. [...] ich euch an dieser Stelle nicht vorenthalten möchte. Nachdem ich gestern schon über ein sprechendes Klavier gestolpert bin, ist dieses Treppen-Piano ja mal der [...]

  49. [...] Speaking piano [...]

  50. Jhhl

    Speaking orchestra? http://www.heraldscotland.com/speakings-a-new-mus…
    Harvey has done excellent work for decades.

    To blow my own horn: my Amiga program RGS is a real time spectrogram paint program (from 1987 originally), which could send out spectra as MIDI information, therefore being able to make my (microtonally tuned) DX7 to emit intelligible and unintelligible speech. http://www.echo.net/~jhhl/Mp3/RGS/

  51. Pianoman

    Intelliscore is another program that converts audio to MIDI. It works with the latest versions of Windows, including Vista and Windows7. The website is: http://www.intelliscore.net/

  52. [...] was really excited by Peter Ablinger’s Speaking Piano–a system that takes human speech and translates it to a sequence of notes to be played on a piano [...]

  53. Sylvaiw

    Why do you say the original voice is mixed with the piano ? Where did you get this information ? I can't find it.

    In my opinion only the piano is heard. and that's the whole interest of this thing.

  54. telfer cronos

    i'm sure you are right, sylvia.

Leave a Reply