Full Project – Design and implementation of text to speech application for vision impaired students
Click here to Get this Complete Project Chapter 1-5
CHAPTER ONE
- INTRODUCTION
As our society farther expands, there have been many supports for second class citizens, like the disabled. One of many supports that is urgent is the guarantee of mobility for blind people. There have been many efforts but even now, it is not easy for blind people to independently move. As electronic technologies have improved, a research about Electrical Aided: EA for blind people has started. With a current product, Human Tech of Japan developed Navigation for blind people, using GPS and cell phone. This system is consists of cell phone of the user (blind people), a subminiature of GPS receiver, a magnetic direction sensor, a control unit and speech synthesis equipment with PC of base station.
Text-To-Speech has been available for decades (since 1939). Unfortunately, quality of the output-especially in terms of naturalness-has historically been sub-optimal. Terms such as “robotic” have been used to describe synthetic speech. Recently, the overall quality of Text-To-Speech from some vendors has dramatically improved. Quality is now evident not only in the remarkable naturalness of inflection and intonation, but also in the ability to process text such as numbers, abbreviations and addresses in the appropriate context.
Text-to-speech (TTS) is a type of speech synthesis application that is used to create a spoken sound version of the text in a computer document, such as a help file or a Web page. TTS can enable the reading of computer display information for the visually challenged person, or may simply be used to augment the reading of a text message. Current TTS applications include voice-enabled e-mail and spoken prompts in voice response systems.
1.2 BACKGROUND OF THE STUDY
Long before electronic signal processing was invented, there were those who tried to build machines to create human speech. Some early legends of the existence of “Brazen Heads” involved Pope Silvester II (d. 1003 AD), Albertus Magnus (1198–1280), and Roger Bacon (1214–1294). In 1779, the Danish scientist Christian Kratzenstein, working at the Russian Academy of Sciences, built models of the human vocal tract that could produce the five long vowel sounds (in International Phonetic Alphabet notation, they are [aː], [eː], [iː], [oː] and [uː]). This was followed by the bellows-operated “acoustic-mechanical speech machine” by Wolfgang von Kempelen of Pressburg, Hungary, described in a 1791 paper. This machine added models of the tongue and lips, enabling it to produce consonants as well as vowels. According to Charles (1857), Wheatstone produced a “speaking machine” based on von Kempelen’s design, and in 1857, M. Faber built the “Euphonia”. Wheatstone’s design was resurrected in 1923 by Paget.
In the 1930s, Bell Labs developed the vocoder, which automatically analyzed speech into its fundamental tone and resonances. From his work on the vocoder, Homer Dudley developed a keyboard-operated voice synthesizer called The Voder (Voice Demonstrator), which he exhibited at the 1939 New York World’s Fair. The Pattern playback was built by Dr. Franklin S. Cooper and his colleagues at Haskins Laboratories in the late 1940s and completed in 1950. There were several different versions of this hardware device but only one currently survives. The machine converts pictures of the acoustic patterns of speech in the form of a spectrogram back into sound. Using this device, Allen J (2007) were able to discover acoustic cues for the perception of phonetic segments (consonants and vowels).
1.3 STATEMENT OF THE PROBLEM
The challenge that was picked up that lead to this piece of project work is that the blind find it difficult to know exactly the word they are typing even though they know the key board very well, still they just assume that they are correct. At the end of the day they will find themselves making a lot of mistakes in their typed works. This lead to the development of this project, Text to Speech Application.
1.4 OBJECTIVE OF THE STUDY
The main objective of this project is to create an application that will convert text to speech in order for the visually impaired student to know exactly what they are typing and inputing in the computer system. The visually impaired student will be well assured of what they are typing and know how to correct their mistake if any typographical error is their work.
1.5 SCOPE OF THE STUDY
The scope of this research work converts text into spoken word, by analyzing and processing the text using Natural Language Processing (NLP) and then using Digital Signal Processing (DSP) technology to convert this processed text into synthesized speech representation of the text.
1.6 SIGNIFICANCE OF THE STUDY
The significance of this project work is to serve as a helping tool for the vision impaired students, thereby, creating a text to speech synthesis application. The blind student will use the software to voice out what they have type.
1.7 LIMITATION OF THE STUDY
The limitations encounter in this research work includes:
- Limited time to carryout research on the subject. Not enough time to gather information for this research work.
- The epileptic nature of power supply in the country. After we have gather the little material – information for this work, there was shortage of power supply to organize our work.
- Another limitation is Finance: doing a research work definitely needs money. Shortage of funds is one of the greatest challenges we encountered during this project.
1.8 DEFINITION OF TERMS
Electrical Aided: An electronic device that help particular disable been to achieve a setting goal
GPS– Global Position System: Is a radio navigation system.
Phonetic: Relating to the sounds of spoken language
Receiver: Person or thing who receives something.
Robot: A machine built to carry out some complex task or group of tasks especially one which can be programmed.
Text: A writing consisting of multiple glyphs, characters, symbols or sentences.
Speech: the ability to speak or to use vocalization to communicate.
System: A collection of organized things
Get the Complete Project
This is a premium project material and the complete research project plus questionnaires and references can be gotten at an affordable rate of N3,000 for Nigerian clients and $8 for international clients.
Click here to Get this Complete Project Chapter 1-5
You can also check other Research Project here:
- Accounting Research Project
- Adult Education
- Agricultural Science
- Banking & Finance
- Biblical Theology & CRS
- Biblical Theology and CRS
- Biology Education
- Business Administration
- Computer Engineering Project
- Computer Science 2
- Criminology Research Project
- Early Childhood Education
- Economic Education
- Education Research Project
- Educational Administration and Planning Research Project
- English
- English Education
- Entrepreneurship
- Environmental Sciences Research Project
- Guidance and Counselling Research Project
- History Education
- Human Kinetics and Health Education
- Management
- Maritime and Transportation
- Marketing
- Marketing Research Project 2
- Mass Communication
- Mathematics Education
- Medical Biochemistry Project
- Organizational Behaviour
- Political Science
- Psychology
- Public Administration
- Public Health Research Project
- More Research Project
- Transportation Management
- Nursing
Full Project – Design and implementation of text to speech application for vision impaired students