Sound is an important aspect of our daily lives, from the music we listen to, the sounds of nature, and even the spoken words we hear. Sound can evoke emotions and affect our mood and behavior. In recent years, advances in technology have made it possible to create more immersive and interactive sound experiences, such as surround sound systems and 3d data visualisation.
In this article, we will explore the concept of Emotional XYZ soundwave, a 3-dimensional sound wave that resonates to voice pitch on the vertical scale and emotions on the depth scale. This innovative technology provides a more immersive and emotionally engaging sound experience, with the ability to create an emotional soundscape that interacts with the listener’s voice and emotions.
The concept of 3D sound has been around for some time, with the goal of creating an immersive sound experience that goes beyond traditional stereo sound. 3D sound technology aims to create a spatial audio experience, where sound is perceived to come from different directions and distances, providing a more realistic and natural listening experience. The Emotional XYZ soundwave builds on this concept by adding emotional depth to the sound, creating a more engaging and interactive experience.
The Emotional XYZ Sound Wave project aims to create a three-dimensional sound wave visualisation that resonates to the emotional content of speech. The sound wave is positioned in three dimensions: the horizontal position corresponds to the left-right panning of the sound, the vertical position corresponds to the pitch of the speaker’s voice, and the depth position corresponds to the emotional content of the speech, with positive emotions positioned in front and negative emotions positioned in the back.
To create the emotional sound wave, we use Voice-coil actuators and Piezoelectric transducers to vibrate analog and digital devices, respectively, in a way that corresponds to the emotional content of the speech. The resulting vibration is captured by a microphone and converted into digital data using an analog-to-digital converter (ADC). We can then process the digital data using techniques such as speech analysis to extract features that correspond to the emotional content of the speech.
To control the positioning of the sound wave, we use an oscillator node and a stereo panner node in a Web Audio API implementation. The panner node is used to control the horizontal position of the sound wave, while the vertical and depth positions are mapped to the oscillator frequency and gain, respectively, using a map function. The frequency and gain values are updated in real-time based on the emotional content of the speech.
Voice-coil actuators: Translating Emotional Content into Vibrations
Voice-coil actuators are devices that can convert electrical signals into mechanical vibrations. These devices are commonly used in speakers, headphones, and other audio devices to create sound waves. However, they can also be used to transmit emotional content through vibrations. By modulating the amplitude and frequency of the vibrations based on the speaker’s emotional state, voice-coil actuators can provide tactile feedback that corresponds to the emotional content of the speech.
Piezoelectric Transducers: Converting Emotions into Tactile Feedback
Piezoelectric transducers are devices that can convert electrical signals into mechanical vibrations and vice versa. They are commonly used in a variety of applications, such as in ultrasonic sensors, loudspeakers, and musical instruments. However, they can also be used to transmit emotional content through vibrations. By attaching piezoelectric transducers to digital devices, such as smartphones or computers, emotional content can be transmitted through vibrations that are felt by the user.
One application of piezoelectric transducers for transmitting emotional content through vibrations is in the development of digital devices, such as smartphones or smartwatches. These devices can be equipped with piezoelectric transducers that are programmed to vibrate in response to the emotional content of the speech. For example, if the speaker is conveying a positive emotion, such as happiness, the device can vibrate with a high frequency, creating a sensation of joy or excitement. Conversely, if the speaker is conveying a negative emotion, such as sadness, the device can vibrate with a lower frequency, creating a sensation of melancholy or sorrow.
Speech Analysis
One of the most widely used techniques for recognizing emotions on a frequency scale is speech analysis. Emotions are associated with variations in the pitch, tone, and speed of speech. For example, when a person is happy, their voice tends to have a higher pitch and is more energetic. When a person is sad, their voice tends to be lower in pitch and slower in tempo. Speech analysis can be used to measure these variations and map them to different frequencies. In this way, the depth of human emotional state can be recognized on a frequency scale.
Electroencephalography (EEG)
EEG is a technique that records the electrical activity of the brain. This technique can be used to recognize emotions by analyzing the frequencies of the brain waves. Different emotions are associated with different frequencies of brain waves. For example, alpha waves (8-13 Hz) are associated with relaxed states, while beta waves (14-30 Hz) are associated with more active states. By analyzing the frequency of the brain waves, the depth of human emotional state can be recognized on a frequency scale.
Heart Rate Variability (HRV) Analysis
HRV analysis is a technique that measures the variation in time intervals between heartbeats. HRV has been found to be associated with emotional states. For example, when a person is stressed, their heart rate tends to be more regular and less variable, while during relaxation, the heart rate tends to be more variable. By analyzing the frequency of these variations in heart rate, the depth of human emotional state can be recognized on a frequency scale.
Conclusion
The Emotional XYZ soundwave project is an innovative approach to creating a more immersive and emotionally engaging sound experience. By using Voice-coil actuators and Piezoelectric transducers to transmit emotional content through vibrations, the project offers a novel way to create a tactile and emotional soundscape. The project’s use of speech analysis, EEG, and HRV analysis to recognize the depth of human emotional state on a frequency scale further highlights the potential applications, such as in virtual reality and gaming to enhance the immersive experience, in therapy and counseling to help individuals recognize and regulate their emotions, and in speech recognition systems to improve accuracy and usability. The project is a novel and exciting exploration of the intersection between technology and emotional expression.
See Emotional XYZ soundwave Github repository
emotional-xyz-soundwave (this link opens in a new window) by guillaumelauzier (this link opens in a new window)
The code computes the STFT of the audio signal using overlapping windows, and various features can be extracted from the resulting STFT matrix, which can then be used to train a machine learning model, such as an SVM, to classify the emotional state of the speaker.
References
- Moheimani SR, Fleming AJ. Piezoelectric transducers for vibration control and damping. London: Springer; 2006 Jun 29.
- Daniel J, James H M. Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition. prentice hall; 2007.
- Luck SJ. An introduction to the event-related potential technique. MIT press; 2014 Jun 20.
- Trimmel K, Sacha J, Huikuri HV, editors. Heart rate variability: Clinical applications and interaction between HRV and heart rate. Frontiers Media SA; 2015 Oct 7.