📞 +91-7667918914 | ✉️ iarjset@gmail.com
International Advanced Research Journal in Science, Engineering and Technology
International Advanced Research Journal in Science, Engineering and Technology A Monthly Peer-Reviewed Multidisciplinary Journal
ISSN Online 2393-8021ISSN Print 2394-1588Since 2014
IARJSET aligns to the suggestive parameters by the latest University Grants Commission (UGC) for peer-reviewed journals, committed to promoting research excellence, ethical publishing practices, and a global scholarly impact.
← Back to VOLUME 12, ISSUE 5, MAY 2025

Feedback Mechanism on Public Speaking using Audio and Video Analysis

Siddaraj M G, Abrar Khan, Ankith Gowda B H, Daivik R, P G Nithin

👁 3 views📥 0 downloads
Share: 𝕏 f in

Abstract: This project introduces an innovative real-time feedback software aimed at enhancing public speaking skills through comprehensive analysis of webcam data. The system evaluates key aspects of body language such as posture, gestures, and eye contact, along with critical speech metrics including filler word usage, speaking pace, and clarity. By delivering instant, actionable feedback and detailed progress reports, it enables users to systematically improve their presentation skills. The software is built using Streamlit for a responsive user interface and backend, a Convolutional Neural Network (CNN) for analyzing non-verbal communication, Hugging Face models for advanced natural language processing, and Librosa for audio analysis and transcription. Trained on a diverse dataset of annotated public speaking videos, the system ensures high accuracy and relevance while maintaining strict privacy and ethical standards. Extensive testing has validated its reliability, and continuous updates based on user feedback allow the software to evolve with technological advancements and user needs. This AI-powered tool represents a significant step forward in making high-quality public speaking training accessible to all.

Keywords: The main keywords of the project are public speaking, real-time feedback, body language, speech analysis, CNN, Hugging Face, Librosa, NLP, audio-visual processing, feature extraction, user interface, Streamlit, Tkinter, machine learning, deep learning, emotion detection, posture, gestures, eye contact, and filler word

How to Cite:

[1] Siddaraj M G, Abrar Khan, Ankith Gowda B H, Daivik R, P G Nithin, “Feedback Mechanism on Public Speaking using Audio and Video Analysis,” International Advanced Research Journal in Science, Engineering and Technology (IARJSET), DOI: 10.17148/IARJSET.2025.125258

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.