Bob Oganovic

Composer, arranger,  engraver & transcriber, choral voices "synthesizer"

Choral Sound Synthesis

What is it? Why do it? What cost/effort is involved?

What is the goal here?

Create the sound of a human chorus singing notes and words – as realistically as possible – using computer hardware and software.

 

Why do it?

 To have recorded “performances” of my compositions to submit – along with the scores – for publication.

 

Why not just recruit a group of singers and record "live"?

I have more than once explored that option. There are two main reasons:

  1. COST – I calculated that for rehearsing and recording 8 to 10 anthems I needed 12 singers for 4 sessions (bare minimum, even with trained, sight-reading-savvy singers who received the music beforehand). So my conservative estimate was that (a) hiring 12 singers, an organist, and a sound engineer along with (b) renting equipment and a room to perform would cost over $4,000.
  2. COVID-19 – I hoped to accomplish this in early 2020, but with the pandemic essentially shutting down choral music activities any effort with “real” singers could not proceed.

 

So the steps involved are described below and the results are on your publisher's website. How difficult were these activities?

Extremely, by virtue of the fact that:

  • Although I was familiar with Sibelius and Microsoft Word the other products were new to me.
  • Even after becoming comfortable with the applications there was a huge challenge in getting multiple music-creation software applications – each with their own eccentricities – to play nicely together.
  • The documentation and online information available for this kind of multi-product integration were sparse, inconsistent, and sometimes just wrong. For example, some online sites suggested music score-writing software (e.g., Sibelius, Finale, Dorico) can integrate directly with choral soundset products like EastWest (described below). However, I found that not to be the case – hence the introduction of the digital audio workstation software (also described below).

Surely what you've done cannot replace the beauty and nuance of a human choir, right?

Correct, and that was never the intent. I am a singer myself – I get it! My goal was to create a serviceable replication of a performance by a chorus – a cappella or with instrumental accompaniment – that could be used for demo, audition, or publisher submission.

So you are available to consult with and/or do this work for other composers?

Absolutely! My experience in singing, composing, working with these applications, and creating websites – these are all skills I bring to the table. And I can do this at a fraction of the cost and much faster than the traditional approach.

Contact me – let’s talk!

 

Summary of the Steps Involved

Click on each numbered line below to jump to the details

1. Start with the necessary software running on a high-end computer

1. Start with the necessary software running on a high-end computer

Start with a PC at the high end in terms of storage and processing. You’ll be running several memory-intensive software applications, often at the same time. Here is my configuration*:

  1. Sibelius — for music score writing.
  2. EastWest Symphonic Choirs — for the human singer soundset and the companion software that comes bundled with it for purposes of creating lyrics through the Votox phonetic language.
  3. Studio One — the digital audio workstation (DAW) application that “hosts”, edits, and creates the synthesized sound.
  4. Garritan Classic Pipe Organs – used for the soundsets that replicate pipe organs (see step 9 below).
  5. Microsoft Word (or any word processing application) — to edit lyrics and convert them to Votox.

*NOTE:

  • All of the above items have competitor products, several of which I have used in the past. Having said that …
  • … Over time this is the workstation footprint I have settled on and which will be the basis for the descriptions that follow.
2. Review score for prospective challenges in MIDI & audio creation

2. Review score for prospective challenges in MIDI & audio creation

Applications like Sibelius are primarily designed for creating scores. Owing to this the score itself sometimes requires adjustment prior to generating MIDI file upon which the choral sound set of EastWest Symphonic Choirs will be applied. Points that may require additional attention include:

  • Breath marks
  • Slurs
  • Tempo change & fermatas
  • Dynamics
  • Missing/mis-aligned lyrics, especially around melismas
  • Pickup notes
  • Part names
3. Strip out (almost) everything from the score and export the MIDI files

3. Strip out (almost) everything from the score and export the MIDI files

For the MIDI file all you want are, basically, notes, rests and time signatures. Everything else will be added back later in the DAW (see step 8). For the MIDI export:

  • Set export for preferred metronome value.
  • Generate one MIDI part at a time.
  • Audit (listen to) the MIDI files individually and together with score in hand to assure all notes and rhythms exported correctly.
  • If errors are encountered revisit Sibelius, correct the problems, and re-generate the MIDI.
 4. Import the MIDI file into the digital audio workstation (DAW)

4. Import the MIDI file into the digital audio workstation (DAW)

  • Import one midi “track” or “line” (e.g., Soprano) at a time.
  • Be sure the time signature and metronome values are correct since DAW may not necessarily pick these up by default. Pickup notes require similar adjustments.
  • Assure part names are correctly assigned.
  • Adjust for eccentricities of EastWest, the DAW, or the MIDI file itself. For example, an issue in EastWest is that (a) the “Soprano” can sing no LOWER than D3 and (b) the “Bass” can sing no HIGHER than D3.
 5. Install EastWest Symphonic Choirs in the DAW for each choral part

5. Install EastWest Symphonic Choirs in the DAW for each choral part

  • Choose and load correct voices.
  • Set up basic configurations.
  • Choose language, phonemes, part names.
  • Test the “voices” both solo and tutti on a neutral syllable like “ahhhhh” to see if any errors occurred during the earlier MIDI export/import cycle and to get a sense of the overall balance within the DAW.
6. Convert lyric text to Votox phonetics

6. Convert lyric text to Votox phonetics

This step (6) and the the following one (step 7are the most critical AND time-consuming of tasks. At a high level the roadmap to convert the choral text from English, Latin, or whatever into Votox* syllables is:

  • Export text from Sibelius to cover ALL the words for the piece. This is typically 1 or 2 parts (or portions of several parts) depending upon such things as the mix of homophony/polyphony and which parts are tacet during particular stretches.
  • In Microsoft Word edit the exported text and convert the text to Votox for one part.
  • Add critical measure numbers from the score.
  • Bring converted Votox text into EastWest Symphonic Choirs.
  • Test the notes with Votox on one part and adjust as necessary.

*NOTE: Votox was developed specifically for use with the EWSC software. It bears some similarity to – but is NOT based on – IPA (or any other phonetic mappings).

 7. Replicate prior steps for other parts

7. Replicate prior steps for other parts

If the Votox conversion in step 6 is successful it can serve as the model for the other parts. Differences is rhythms, melismas, rests etc. will require adjustments across parts, and for that reason it is best to work on one part at a time.

Other “gotcha’s” in Votox are similar to things singers always need to be vigilant about:

  • Diphthongs
  • Final consonants, including shadow vowels and the nasal (“American”) final “R” sound
  • Text attack – normal vs. legato vs. staccato

Much of the success (or otherwise) here is a result of trial and error. For example:

  • Problem: “K” or hard “G” sounds are coming across too forcefully on a particular note when all 4 voices are singing.
  • (Possible) solution: Have just Alto and Bass sing those consonants while Soprano and Tenor go right to the ensuing vowel sound.
8. Convert MIDI tracks to audio and "edit back in" dynamics, expressions, tempo changes, fermatas et al

8. Convert MIDI tracks to audio and "edit back in" dynamics, expressions, tempo changes, fermatas et al

Once the choral lyrics and notes are aligned you move on to converting the MIDI parts to audio parts* (e.g., WAV, .MP3). This is where dynamics, expressions, tempo changes, fermatas etc. return to the piece. Working with both the original Sibelius score and the DAW convert each part from MIDI to audio and then:

  • Adjust volume levels (dynamics) first across all parts and then …
  • Between parts in measures.
  • Add fade-in (-out) on selected measures.
  • Adjust panning.
  • Add reverberation – i.e., more echo (“wetter”) or less (“drier”)
  • Adjust tempo across all parts.
  • Produce final ouptut in .WAV, .MP3 or whatever audio format – this is uaually referred to as a “mixdown”.

*For a good “101-type” discussion of MIDI vs. audio see this article.

9. Add organ (or other instrumental) accompaniment

9. Add organ (or other instrumental) accompaniment

Unlike the sounds of singers and their words, Sibelius and others have long been able to replicate instrument soundsets with good quality and relative ease. However, replicating pipe organ sounds requires some special considerations. Again, at a high level here are the steps involved:

  • The MIDI output is still generated from Sibelius and imported into DAW but …
  • … The soundset is now a product like Garritan rather than EastWest.
  • Coming from Sibelius it is easiest to think in terms of the initial MIDI file being Organ 1 (mostly right hand/treble clef), Organ 2 (mostly left hand/bass clef), and Pedal.
  • In the DAW each organ stop for each part (Organ 1, Organ 2, Pedal) requries its own track, much like a score for band or orchestra.

So, on the last point, in a simplified example:

  • If Organ 1 has 5 stops throughout its entire part it will have 5 tracks.
  • If Organ 2 is coupled to those exact same 5 stops it will have its own 5 tracks.
  • Finally, assume Pedal has a couple of unique stops throughout the score.
  • The net/net is that there will be 12 (5 + 5 +2) tracks altogether.

Like the choral tracks the organ lines require the same drill:

  • Listen to (audit) the MIDI lines separately and then together for accuracy.
  • When squared away in MIDI convert and collapse the tracks to audio.
  • Edit for the usual dynamics etc.
  • Mix down for the final version.

Contact Me (*required)

14 + 3 =