PC able to talk and listen

compuwhiz93

Member
hi there

I think it is time that the computer should be able to PERFECTLY understand the human through speech and respond back to him, like in movies (iron man even flubber :D).
I used the speech recognition software in vista but it still needs loads of improvement.

So i was wondering if any of you know about any software which helps in doing this, or atleast something that can make the PC have cool conversations with you and read texts with you (i know about narrator, very robotic).

Eagerly waiting for any discussions regarding this.

Compuwhiz93
 

My Computer

System One

  • Manufacturer/Model
    Toshiba Satellite L350D, Lenovo Thinkpad T510
    CPU
    AMD Athlon X2 QL-60 1.90GHZ
    Memory
    3.00GB, 4.00GB
    Graphics Card(s)
    ATI Radeon 256MB, Nividea NVS 512MB
    Screen Resolution
    1400x900
    Hard Drives
    200GB in the Laptop
    320 WD external HD
    2X 320 WD internal Hard drives which i use as external.
    320GB in my lenovo laptop
    Mouse
    Logitech wireless USB mouse
    Internet Speed
    10.00MB

Dwarf

The Contemplator
Vista Guru
Gold Member
Although great improvements have been made with speech recognition, home computers currently don't have the capacity to comprehend fully spoken language. For a start off, words can have a slightly different meaning depending on how it is pronounced, and the context in which it is used. In addition, the human brain can recognise the same word when pronounced in many different ways by people with differing accents and dialogue inflections. Unless the speech recognition system in a computer has already learnt the word, and the ways that it may be pronounced, it cannot understand a word if asked to recognise it through audio input. Even if it does know the word, it might not recognise it if it comes in a different accent and inflection.

As regards the computer outputting speech, it is currently very robotic because the number of voices available is very limited. Although the pitch and speed can be varied, this usually affects all of what is being output by the same amount, unlike normal human speech which can change in pitch and speed dynamically. Whilst certain punctuation can make the pitch of the word prior to it rise or fall, the end result is still very much robotic.

Unfortunately, home computers simply don't have the memory capacity or resources in order to be able to speak and to understand speech in the same way as a human brain can. Mainframe computers fair better because they have more memory and resources available, in addition to more processing power thanks to them supporting multiple CPUs (realistic speech comprehension and reproduction is CPU-intensive).
 

My Computer

System One

  • Manufacturer/Model
    Dwarf Dwf/11/2012 r09/2013
    CPU
    Intel Core-i5-3570K 4-core @ 3.4GHz (Ivy Bridge) (OC 4.2GHz)
    Motherboard
    ASRock Z77 Extreme4-M
    Memory
    4 x 4GB DDR3-1600 Corsair Vengeance CMZ8GX3M2A1600C9B (16GB)
    Graphics Card(s)
    MSI GeForce GTX770 Gaming OC 2GB
    Sound Card
    Realtek High Definition on board solution (ALC 898)
    Monitor(s) Displays
    ViewSonic VA1912w Widescreen
    Screen Resolution
    1440x900
    Hard Drives
    OCZ Agility 3 120GB SATA III x2 (RAID 0)
    Samsung HD501LJ 500GB SATA II x2
    Hitachi HDS721010CLA332 1TB SATA II
    Iomega 1.5TB Ext USB 2.0
    WD 2.0TB Ext USB 3.0
    PSU
    XFX Pro Series 850W Semi-Modular
    Case
    Gigabyte IF233
    Cooling
    1 x 120mm Front Inlet 1 x 120mm Rear Exhaust
    Mouse
    Microsoft Comfort Mouse 3000 for Business (USB)
    Keyboard
    Microsoft Comfort Curve Keyboard 3000 (USB)
    Internet Speed
    NetGear DG834Gv3 ADSL Modem/Router (Ethernet) ~4.0 Mb/s (O2)
    Other Info
    Optical Drive: HL-DT-ST BD-RE BH10LS30 SATA Bluray
    Lexmark S305 Printer/Scanner/Copier (USB)
    WEI Score: 8.1/8.1/8.5/8.5/8.25
    Asus Eee PC 1011PX Netbook (Windows 7 x86 Starter)

compuwhiz93

Member
Yeah a pity thought, looks like we still have a long way to go until we get that technology.
Anyways thanks a lot for the reply.
 

My Computer

System One

  • Manufacturer/Model
    Toshiba Satellite L350D, Lenovo Thinkpad T510
    CPU
    AMD Athlon X2 QL-60 1.90GHZ
    Memory
    3.00GB, 4.00GB
    Graphics Card(s)
    ATI Radeon 256MB, Nividea NVS 512MB
    Screen Resolution
    1400x900
    Hard Drives
    200GB in the Laptop
    320 WD external HD
    2X 320 WD internal Hard drives which i use as external.
    320GB in my lenovo laptop
    Mouse
    Logitech wireless USB mouse
    Internet Speed
    10.00MB

Shadowjk

Network Enthusiast
Vista Guru
Gold Member
It is funny that humans struggle doing things that computers find easy like answering calculations and that computers struggle at things that humans can do easily like speaking
 

My Computer

System One

  • Manufacturer/Model
    Custom Built
    CPU
    Intel Core i5 2400 @ 3.10GHz
    Motherboard
    Foxconn H67MP-S/-V/H67MP
    Memory
    8.0GB DDR3 @ 665MHz (2GBx4)
    Graphics Card(s)
    AMD HD Radeon 6870
    Sound Card
    Realtek High Definition Audio
    Monitor(s) Displays
    SMB1930NW ([email protected])
    Screen Resolution
    1280x800
    Hard Drives
    977GB Seagate ST31000524AS ATA Device (SATA) + 250GB WD iSCSI attached Drive
    Case
    Novatech Night
    Mouse
    Dell HID-compliant mouse
    Keyboard
    Standard PS/2 Keyboard

Dwarf

The Contemplator
Vista Guru
Gold Member
Humans tend to work best with analog data, whereas a computer is primarily a digital entity. Certain things lead themselves better towards digital interpretation, and others towards analogue. Some are at home in both worlds.

Calculations and text recognition (picking letters and/or words from a given piece of text) is something that computers find easy to do because both can be programmed digitally.

Speech is one of those things that is, by its nature, an analogue quantity. To the human brain, this is of no consequence, but to a computer it is a completely alien concept. The nearest that we can get to this is listening to someone in a foreign language. In order to understand and process the information, there has to be some form of conversion so that the brain and/or computer can understand. With languages, we typically learn this whilst we are at school, although for a good many of us much of this is lost when we grow older and we end up primarily using our home language. A computer needs to have an analog signal translated into its digital equivalent in order to process it. Here lies the main limitation. Conversion is done by means of quantitization, whereby an analog signal is sampled and assigned an equivalent digital value. The rate of sampling, and the number of quantitization steps between an input of 0% and one of 100% is of prime importance. The higher both of these values is, the more accurate the processing of the signal becomes. Unfortunately, this is its limitation, and it is why I stated in my earlier post that realistic speech comprehension and reproduction is CPU-intensive.
 

My Computer

System One

  • Manufacturer/Model
    Dwarf Dwf/11/2012 r09/2013
    CPU
    Intel Core-i5-3570K 4-core @ 3.4GHz (Ivy Bridge) (OC 4.2GHz)
    Motherboard
    ASRock Z77 Extreme4-M
    Memory
    4 x 4GB DDR3-1600 Corsair Vengeance CMZ8GX3M2A1600C9B (16GB)
    Graphics Card(s)
    MSI GeForce GTX770 Gaming OC 2GB
    Sound Card
    Realtek High Definition on board solution (ALC 898)
    Monitor(s) Displays
    ViewSonic VA1912w Widescreen
    Screen Resolution
    1440x900
    Hard Drives
    OCZ Agility 3 120GB SATA III x2 (RAID 0)
    Samsung HD501LJ 500GB SATA II x2
    Hitachi HDS721010CLA332 1TB SATA II
    Iomega 1.5TB Ext USB 2.0
    WD 2.0TB Ext USB 3.0
    PSU
    XFX Pro Series 850W Semi-Modular
    Case
    Gigabyte IF233
    Cooling
    1 x 120mm Front Inlet 1 x 120mm Rear Exhaust
    Mouse
    Microsoft Comfort Mouse 3000 for Business (USB)
    Keyboard
    Microsoft Comfort Curve Keyboard 3000 (USB)
    Internet Speed
    NetGear DG834Gv3 ADSL Modem/Router (Ethernet) ~4.0 Mb/s (O2)
    Other Info
    Optical Drive: HL-DT-ST BD-RE BH10LS30 SATA Bluray
    Lexmark S305 Printer/Scanner/Copier (USB)
    WEI Score: 8.1/8.1/8.5/8.5/8.25
    Asus Eee PC 1011PX Netbook (Windows 7 x86 Starter)

Dwarf

The Contemplator
Vista Guru
Gold Member
In addition to what I have said above, you need to remember that certain words, although spelt differently, can sound exactly the same, and it is only the context in which they are used that differentiates the spelling that we use (if writing it down) and its meaning.

For example: Where (as in place), Wear (as in clothing, or eroding), and Ware (as in items) all sound exactly the same, and a computer would have a great deal of difficulty in deciphering which particular meaning that you wish to convey.
 

My Computer

System One

  • Manufacturer/Model
    Dwarf Dwf/11/2012 r09/2013
    CPU
    Intel Core-i5-3570K 4-core @ 3.4GHz (Ivy Bridge) (OC 4.2GHz)
    Motherboard
    ASRock Z77 Extreme4-M
    Memory
    4 x 4GB DDR3-1600 Corsair Vengeance CMZ8GX3M2A1600C9B (16GB)
    Graphics Card(s)
    MSI GeForce GTX770 Gaming OC 2GB
    Sound Card
    Realtek High Definition on board solution (ALC 898)
    Monitor(s) Displays
    ViewSonic VA1912w Widescreen
    Screen Resolution
    1440x900
    Hard Drives
    OCZ Agility 3 120GB SATA III x2 (RAID 0)
    Samsung HD501LJ 500GB SATA II x2
    Hitachi HDS721010CLA332 1TB SATA II
    Iomega 1.5TB Ext USB 2.0
    WD 2.0TB Ext USB 3.0
    PSU
    XFX Pro Series 850W Semi-Modular
    Case
    Gigabyte IF233
    Cooling
    1 x 120mm Front Inlet 1 x 120mm Rear Exhaust
    Mouse
    Microsoft Comfort Mouse 3000 for Business (USB)
    Keyboard
    Microsoft Comfort Curve Keyboard 3000 (USB)
    Internet Speed
    NetGear DG834Gv3 ADSL Modem/Router (Ethernet) ~4.0 Mb/s (O2)
    Other Info
    Optical Drive: HL-DT-ST BD-RE BH10LS30 SATA Bluray
    Lexmark S305 Printer/Scanner/Copier (USB)
    WEI Score: 8.1/8.1/8.5/8.5/8.25
    Asus Eee PC 1011PX Netbook (Windows 7 x86 Starter)

ilikefree

Vista Guru
Gold Member

My Computer

System One

  • Manufacturer/Model
    Lenovo Thinkpad T400
    CPU
    Intel Mobile Core 2 Duo P8700 @ 2.53GHz
    Motherboard
    LENOVO 64734VM
    Memory
    2.00GB Single-Channel DDR3 @ 531MHz
    Graphics Card(s)
    Intel Mobile Intel 4 Series Express Chipset Family
    Sound Card
    Conexant 20561 SmartAudio HD
    Monitor(s) Displays
    15 inch
    Screen Resolution
    1280 x 800
    Hard Drives
    1x 180GB Intel 530 series SSD
    1 x 120GB Hitachi 5400rmp
    1 x 650GB Western Digital Elements 5400rpm
    1x 1Tb Western Digital Elements 5400rpm
    Internet Speed
    Medium for New Zealand
    Other Info
    Weakest part of my computer is the graphics chipset.
    Only ever used a laptop.
    Also use USB Freeview TV Card
    Lenovo Docking Station
    External Speakers
    Other bits a pieces as needed

compuwhiz93

Member
Thanks, i'll give it a try and tell you.
 

My Computer

System One

  • Manufacturer/Model
    Toshiba Satellite L350D, Lenovo Thinkpad T510
    CPU
    AMD Athlon X2 QL-60 1.90GHZ
    Memory
    3.00GB, 4.00GB
    Graphics Card(s)
    ATI Radeon 256MB, Nividea NVS 512MB
    Screen Resolution
    1400x900
    Hard Drives
    200GB in the Laptop
    320 WD external HD
    2X 320 WD internal Hard drives which i use as external.
    320GB in my lenovo laptop
    Mouse
    Logitech wireless USB mouse
    Internet Speed
    10.00MB
Top