©copyright Raimo Olavi Toivonen 19842019. All rights reserved. Last updated on August 8, 2019

Home  Signal displays 1985–  Multi-envelope displays 1998–  FFT displays 1985–  Cepstrum displays 1983−  LPC displays 1992–  Auditory displays 1983− Loudness displays 1995–  Loudness curves 1998–  Timbre spectrum 1983−  LTAS displays 1982–  Harmonic FFT displays 1989–  F0 displays 1983−  F0 histograms in semitone scale 1989−  Computer Voice Fields in Hz scale 1983−  Waterfall displays 1983−  FFT spectrogram displays 1985–  LPC spectrogram displays 1998–  Auditory spectrogram displays 1985–  Bark, ERB, mel, semitone and Hz scales Formant charts 1988−  Bark scale 1983−  ERB scale 1996−  Semitone scale 1989−  Other links  Formant charts of world languages in Bark scale 2018–

Works 1972−

ISA software

Jitter/shimmer in lin/Hz/dB scale on Intelligent Speech Analyser™ (ISA)

The fast F0 analysis I have developed is done by bandpass filtering, first by looking for a base period in the time signal using a unique method. Each basic cycle is determined with high accuracy and even small variations in the fundamental frequency can be seen.

The F0 algorithm I developed allows me to determine the base frequency jitter and shimmer.

The F0 algorithm I have developed has been in use for 34 years since 1984 and was running in real time on the NEC's uPD7720 signal processor as early as 1985, 33 years ago.

Analysis of images I have from the very beginning coded with Neon object-oriented programming language.

Who else in Finland, already 34 years ago, coded the baseline period from baseline analysis to baseline voice analysis and already in 1985 in real time?

F0 analysis, I as a DSP man coded from zero to different machine languages.

Jitter formula: 100 * scattering values ​​(f2-f1 / f1), f1 and f2 consecutive cycles.

Jitter formula: 100 * average of | (f2-f2 / f1) |, f1 and f2 consecutive periods.

Shimmer formula: averages values ​​of | a2-a1 |, a1 and a2 for max. amplitudes in dB as mourning.

Psychoacoustic links to "en.wikipedia.org/wiki":  Psychoacoustics   Auditory   Auditive   Auditory phonetics   Auditory system anatomy   Sone scale   Phon scale   Loudness   Equal-loudness contours   Fletcher–Munson curves   Bark scale   Critical bands   Auditory Filters   ERB scale (Equivalent rectangular bandwidth)   Mel scale   Semitone scale   Hertz scale   Sound pressure   SPL (Sound pressure level)   Stevens's power law   Stanley Smith Stevens   Harvey Fletcher   Karl Eberhard Zwicker

Jitter and shimmer curves without time in the Hz and dB scales.

Jitter histogram on a percentage/Hz scale.

Jitter Distribution 1.3.2001−. When the raw jitter curve is transformed into a distribution, completely new analytical images and statistical dimensions for the jitter are created. There are 251 values ​​in the jitter distribution from zero to increments of 0.1 Hz to 25 Hz. The key measure of the distribution is the width of the distribution, which is obtained by calculating the width of the distribution, which contains 90% of the periods. This percentage is selectable (eg 80%). The Hz width can also be normalized to a percentage, whereby 90% of the width is divided by the average base frequency. The resulting normalized percentage can be compared between men, women and children. The success of speech therapy is manifested by a decrease in the width of the jitter distribution, that is, a decrease in the Hz value and percentage value of the distribution width. Jitter distribution can also be successfully generated from literal text, since large jitter changes after word and byte borders remain at the tail of the distribution and disappear at 90% width.

Jitter-jakauma 1.3.2001−. Kun raakajitterkäyrä ja muutetaan jakaumaksi syntyy aivan uudenlaiset analyysikuvat ja tilastolliset mitat jitterille. Jitter-jakaumassa on arvoja nollasta lähtien 0.1 Hzn välein 25 Hzn saakka yhteensä 251 kpl. Jakauman keskeinen tunnusluku on jakauman leveys, joka saadaan laskemalla jakauman se leveys, johon sisältyy 90% jaksoista. Tämä prosenttiluku on valittavissa (esim. 80%). Hz-leveys on normalisoitavissa myös prosenteiksi, jolloin 90% leveys jaetaan keskimääräisellä perustaajuudella. Saatua normalisoitua prosenttiarvoa voidaan vertailla miesten, naisten ja lasten kesken. Puheterapian onnistuminen näkyy jitter-jakauman leveyden pienenemisenä eli jakauman leveyden Hz-arvon ja prosenttiarvon pienenemisenä. Jitter-jakauma voidaan menestyksellisesti tuottaa myös suorasanaisesta tekstistä, koska sanarajojen ja tavurajojen jälkeiset suuret jitter-muutokset jäävät jakauman häntään ja poistuvat 90% leveyden yhteydessä.

Shimmer distribution on a percentage / dB scale.

Shimmer distribution 1.3.2001−. Changing the raw dimmer curve to a distribution gives rise to a whole new set of analysis images and statistical dimensions for the shimmer. The Shimmer distribution has values ​​ranging from zero in 0.1 dB increments up to 25 dB, for a total of 251. The resolution of the distribution may be lower than 0.1 dB, for example 0.05 dB. The Shimmer distribution is a 90% width of the distribution. The value is in dB and is directly comparable between men, women and children. The success of speech therapy is manifested by a decrease in the width of the shimmer distribution, that is, a decrease in the dB and percentage of the width of the distribution. Shimmer distributions can also be successfully generated from literal text, since large shimmer changes after word and byte borders remain at the tail of the distribution and disappear at 90% width.

Shimmer-jakauma prosentti/dB-asteikolla.

Shimmer-jakauma 1.3.2001−. Kun raakashimmerkäyrä muutetaan jakaumaksi syntyy aivan uudenlaiset analyysikuvat ja tilastolliset mitat shimmerille. Shimmer-jakaumassa on arvoja nollasta lähtien 0.1 dBn välein 25 dB:hen saakka yhteensä 251 kpl. Jakauman resoluutio 0.1 dB:tä voi olla pienempikin vaikkapa 0.05 dB:tä. Shimmer-jakauman tunnusluku on jakauman 90% leveys. Arvo on dB:leinä ja verrattavissa suoraan miesten, naisten ja lasten kesken. Puheterapian onnistuminen näkyy shimmer-jakauman leveyden pienenemisenä eli jakauman leveyden dB-arvon ja prosentti-arvon pienenemisenä. Shimmer-jakauma voidaan menestyksellisesti tuottaa myös suorasanaisesta tekstistä, koska sanarajojen ja tavurajojen jälkeiset suuret shimmer-muutokset jäävät jakauman häntään ja poistuvat 90% leveyden yhteydessä.