PDF to mp3 converter

PDF to mp3 converter

Overview

The PDF to MP3 converter converts text documents in PDF format into recorded speech. The text to speech conversion will enable blind people to have easier access to electronic material. Other groups of people with learning or reading disabilities will have an easier time processing the electronic information.
The project has two parts: a complete implementation and background research into which disabled groups, i.e. blind and learning disabled, can benefit from electronic text readers.
Progress Report

The current demo I have automatically converts a pdf file into mp3 format. I edited some HTML and manually converted WAV to MP3 although in the future this will be automated.

The pdf file is my Fall 02 IP
*magic converter*
Zipped Mp3 files (20 MB) (disabled for now, need more disk space)
PPT Presentation

Implementation

The detailed technical description of is project is:
PDF --> text --> text-to-speech --> audio --> saved audio file.

There is existing software for each stage that I would integrate. Adobe has an Acrobat SDK, which can extract text from a PDF. The text can be converted to speech with the Microsoft text-to-speech software or maybe even Jaws. The generated speech can be played immediately or saved to file (WAV or Mp3 format). The different components can be combined using Microsoft tools, such as Visual Basic or Visual Studio C++.
PDF 2 text
An Adobe SDK provides a programmable interface to the Adobe Acrobat Reader and Abode Acrobat (the full version). The SDK should provide access to the text of the PDF. There are other solutions on the web to access text from pdf. Here are some solutions.

Adobe online pdf--> HTML converter

Adobe accessibility

Acrobat Reader 5.1 converts to text

CZ-Pdf2Txt Simple for acrobat reader V1.1 (terrible)

PDF-to-HTML.com

PDFtohtml with source code

Text-to-speech
Microsoft provides free text-to-speech software. I imagine possible enhancements to simply reading the text with one voice. The voice could change from a man's voice to a woman's voice when reading titles, italics, or other outstanding text.

Microsoft research text-to-speech

Emacspeak

For Win32 using ViaVoice

Save speech to audio file
The MS text-to-speech generates the audio and can also store the audio to a WAV file. I am sure code exists to convert the WAV to mp3. Once in Mp3, I could imagine downloading the audio to an MP3 player and listening to the PDF paper while walking outside, riding the bus, or in the car.

MP3 encoder (Lame)

Not Lame

Changing speed of audio files

Disability study

The second element to this project is to investigate the significance of a PDF reader for those who are disabled. Obviously, blind people will be able to benefit. Sighted, however, who have learning disabilities can also benefit. Chapter 6 of the class text book describes how text-to-speech is beneficial for the learning disabled. I also plan to interview the experts at the university learning disability center to find the benefits of text-to-speech software.

www.cast.org

American foundation for the blind

UNC Learning Disabilities Services

LDonline

Technology

Learning Disabilities Association of America

Speech recog. for people with disabilities

Ability Hub

Technology overview

Landmark college

Paper: Object-oriented Collaborative Course Authoring Environment supported by Concept Mapping in MyEnglishTeacher (2001)

Adaptive Authoring of Adaptive Educational Hypermedia

IEEE Learning Technology Standards Committee (LTSC)

Writing organization

Inspiration Inc.

Review

Prodict

Ashley Software

Product: Writer's Block

The Learning Company (down server?)

Student Writing center

KidWorks Deluxe

http://www.davd.com/ (server down?)

DraftBuilder

research

Writers Helper

Writing for LD

Using Technology to Enhance the Writing Processes of Students with Learning Disabilities

More papers

Ldonline papers on writing

Journal of Learning disabilities

Learning Disabilities Research & Practice

Kluwer online

Journal of Educational Psychology (Educational Psychology Review)

Reading and Writing Quarterly

Prompting software

Prompting to write stories

Great: Software to Support Writing Instruction

PREWRITING PROMPTS

A Comparative Study of the First-Generation Invention Software

1983 :: Shaping the Next Generation of Educational Word Processing

Indirections

Reading

SQ3R - A READING/STUDY SYSTEM

Active listening (by RFBD)

Digital Talking books

daisy consortium

National Information Standard organization (NISO)

Other organizations

Learn NC

Learning center

http://members.aol.com/markr13/FearMongers5.html fear of changing culture. It is the unknown that makes it difficult. People can live in different cultures. arguemnt should address fear. The deaf looked for examples where connection to culture was lost.

top* home * academics
dorian miller, 1/29/2003