Speech and Computer, Kartoniert / Broschiert
Speech and Computer
- 24th International Conference, SPECOM 2022, Gurugram, India, November 14-16, 2022, Proceedings
(soweit verfügbar beim Lieferanten)
- Herausgeber:
- S. R. Mahadeva Prasanna, Alexey Karpov, K. Samudravijaya, Shyam S. Agrawal
- Verlag:
- Springer, 11/2022
- Einband:
- Kartoniert / Broschiert, Paperback
- Sprache:
- Englisch
- ISBN-13:
- 9783031209796
- Artikelnummer:
- 11102385
- Umfang:
- 736 Seiten
- Nummer der Auflage:
- 22001
- Ausgabe:
- 1st edition 2022
- Gewicht:
- 1095 g
- Maße:
- 235 x 155 mm
- Stärke:
- 40 mm
- Erscheinungstermin:
- 13.11.2022
- Hinweis
-
Achtung: Artikel ist nicht in deutscher Sprache!
Klappentext
Thematic Diversity of Everyday Russian Discourse: a Case Study Based on the ORD corpus.- Neural Embedding Extractors for Text-Independent Speaker Verification.- Deep Speaker Embeddings based Online Diarization.- Overlapped Speech Detection Using AM-FM based Time-Frequency Representations.- Significance of Dimensionality Reduction in CNN-based Vowel Classification from Imagined Speech using Electroencephalogram Signals.- Study of Speech Recognition System Based on Transformer and Connectionist Temporal Classification Models for Low Resource Language.- An Initial Study on Birdsong Re-synthesis using Neural Vocoders.- Speech Music Overlap Detection using Spectral Peak Evolutions.- Influence of Accented Speech in Automatic Speech Recognition: A Case Study on Assamese L1 Speakers Speaking Code Switched Hindi-English.- ClusterVote: Automatic Summarization Dataset Construction with Document Clusters.- Comparing Unsupervised Detection Algorithms for Audio Adversarial Examples.- Celtic EnglishContinuum in Pitch Patterns of Spontane-ous Talk: Evidence of Long-Term Contacts .- Coherence Based Automatic Essay Scoring Using Sentence Embedding and Recurrent Neural Networks.- Analysis of Automatic Evaluation Metric on Low-Resourced Language: BERTScore Vs BLEU Score.- DyCoDa: A Multi-Modal Data Collection of Multi-User Remote Survival Game Recordings.- On the Use of Ensemble X-Vector Embeddings for Improved Sleepiness Detection.- Multiresolution Decomposition Analysis via Wavelet Transforms for Audio Deepfake Detection .- Automatic Rhythm and Speech Rate Analysis of Mising Spontaneous Speech.- An Electroglottographic Method for Assessing the Emotional State of the Speaker.- Significance of Distance on Pop Noise for Voice Liveness Detection .- CRIM's Speech Recognition System for OpenASR21 Evaluation with Conformer and Voice Activity Detector Embeddings.- Joint Changes in First and Second Formants of /a/, /i/, /u/ Vowels in Babble Noise - a New Statistical Approach.- Comparing NLPSolutions for the Disambiguation of French Heterophonic Homographs for End-to-End TTS Systems.- Detection of Speech Related Disorders by Pre-Trained Embedding Models Extracted Biomarkers.- Multi-Label Dysfluency Classification.- Harnessing Uncertainty - Multi-Label Dysfluency Classification with Uncertain Labels.- Continuous Wavelet Transform for Severity-Level Classification of Dysarthria.- Significance of Energy Features for Severity Classification of Dysarthria.- Sailor and Hemant A. Patil An Analytic Study on Clustering-based Pseudo-Labels for Self-Supervised Deep Speaker Verification.- Investigation of Transfer Learning for End-to-End Russian Speech Recognition.- Prosodic Features of Verbal Irony in Russian and French: Universal vs. Language-Specific.- Categorization of Threatening Speech Acts.- Assessment of Speech Quality During Speech Rehabilitation Based on the Solution of the Classification Problem.- Multi-level Fusion of Fisher Vector Encoded BERT and wav2vec 2.0 Embeddingsfor Native Language Identification.- Fake Speech Detection using OpenSMILE Features.- Nonverbal Constituents of Argumentative Discourse: Gesture and Prosody Interaction.- Classifying Mahout and Social Interactions of Asian Elephants based on Trumpet Calls.- Recognition of the Emotional State of Children with Down Syndrome by Video, Audio and Text Modalities: Human and Automatic.- Fake Speech Detection using Modulation Spectrogram.- Self-Configuring Genetic Programming Feature Generation in Affect Recognition Tasks.- A Multi[1]Modal Approach to Mining Intent from Code-Mixed Hindi-English Calls in the Hyperlocal-Delivery Domain.- Importance of Supra-Segmental Information and Self-Supervised Framework for Spoken Language.- Diarization Task.- Low-resource Emotional Speech Synthesis: Transfer Learning, Data requirements and Adversarial Training.- Fuzzy Classifier For Speech Assessment in Speech Rehabilitation.- Analysis-by-Synthesis Modeling of Bengali Intonation.- Neural Network Based Curve