Methods, apparatuses, and computer program products for anomaly detection are provided. For example, a computer-implemented method includes receiving data corresponding to a plurality of metrics from a plurality of devices, inputting at least a first portion of the received data covering a first time period into a data model to train the data model, inputting at least a second portion of the received data covering a second time period into the trained data model, identifying from the data model one or more anomalies corresponding to one or more of the plurality of metrics of one or more of the plurality of devices, determining if a predefined filter exists for the one or more of the plurality of metrics corresponding to each of the one or more anomalies identified, and applying the predefined filter to determine if the corresponding identified anomaly will be recorded as a reportable anomaly.
A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training environment exhibits both desired user speech and unwanted background speech, including unwanted speech from persons other than a user and also speech from a PA system. The speech recognition system is trained or otherwise programmed to identify wanted user speech which may be spoken concurrently with the background sounds. In an embodiment, during the pre-field-use phase the training or programming may be accomplished by having persons who are training listeners audit the pre-recorded sounds to identify the desired user speech. A processor-based learning system is trained to duplicate the assessments made by the human listeners.
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
G10L 25/78 - Detection of presence or absence of voice signals
09 - Scientific and electric apparatus and instruments
Goods & Services
Computer software for management via the internet of
voice-based computer systems; computer software for
interfacing with computer systems utilizing voice
applications.
4.
SYSTEMS, APPARATUSES, METHODS, AND COMPUTER PROGRAM PRODUCTS FOR INITIATING PERFORMANCE OF ONE OR MORE ENTERPRISE OPERATIONS ACTIONS
Systems, apparatuses, methods, and computer program products are provided herein. For example, an embodiment described herein includes an enterprise operations system. In some embodiments, the enterprise operations system may include an edge enterprise operations device configured to receive at least a first portion of enterprise implementation data from at least one of the one or more edge supported wearable devices. In some embodiments, the edge enterprise operations device may be configured to generate, based at least in part on applying the server enterprise operations data and the enterprise implementation data to a composite edge enterprise operations machine learning model, edge enterprise operations data. In some embodiments, the edge enterprise operations device may be configured to initiate performance of one or more edge enabled enterprise operations actions based at least in part on the edge enterprise operations data.
H04L 67/1001 - Protocols in which an application is distributed across nodes in the network for accessing one among a plurality of replicated servers
H04L 67/12 - Protocols specially adapted for proprietary or special-purpose networking environments, e.g. medical networks, sensor networks, networks in vehicles or remote metering networks
09 - Scientific and electric apparatus and instruments
Goods & Services
(1) Computer software for management via the internet of voice-based computer systems; computer software for interfacing with computer systems utilizing voice applications.
7.
DISTINGUISHING USER SPEECH FROM BACKGROUND SPEECH IN SPEECH-DENSE ENVIRONMENTS
A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training environment exhibits both desired user speech and unwanted background speech, including unwanted speech from persons other than a user and also speech from a PA system. The speech recognition system is trained or otherwise programmed to identify wanted user speech which may be spoken concurrently with the background sounds. In an embodiment, during the pre-field-use phase the training or programming may be accomplished by having persons who are training listeners audit the pre-recorded sounds to identify the desired user speech. A processor-based learning system is trained to duplicate the assessments made by the human listeners.
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
G10L 25/78 - Detection of presence or absence of voice signals
A method for identifying a second device by a first device for establishing a communication between the first device and the second device is described here. The method includes receiving, by a processor of a first device, a voice command from a worker in a workplace. In an example, the method comprises pausing, by the processor, a workflow operation executing on the first device. The method further comprises performing, by the processor, a voice recognition to analyze the voice command of the worker. The method includes activating, by the processor, a communication module of the first device based on the voice recognition, to identify a second device in proximity to the first device. The method includes terminating, by the processor, a connection between the first device and the wearable electronic device. Thus, terminating, by the processor, a second connection of the first device with the second device.
G06F 9/48 - Program initiatingProgram switching, e.g. by interrupt
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 15/30 - Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
H04B 5/72 - Near-field transmission systems, e.g. inductive or capacitive transmission systems specially adapted for specific purposes for local intradevice communication
H04M 1/72412 - User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories using two-way short-range wireless interfaces
H04W 4/80 - Services using short range communication, e.g. near-field communication [NFC], radio-frequency identification [RFID] or low energy communication
9.
Distinguishing user speech from background speech in speech-dense environments
A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training environment exhibits both desired user speech and unwanted background speech, including unwanted speech from persons other than a user and also speech from a PA system. The speech recognition system is trained or otherwise programmed to identify wanted user speech which may be spoken concurrently with the background sounds. In an embodiment, during the pre-field-use phase the training or programming may be accomplished by having persons who are training listeners audit the pre-recorded sounds to identify the desired user speech. A processor-based learning system is trained to duplicate the assessments made by the human listeners.
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
G10L 25/78 - Detection of presence or absence of voice signals
10.
Systems and Methods for Dynamically Improving User Intelligibility of Synthesized Speech in a Work Environment
A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.
A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training environment exhibits both desired user speech and unwanted background speech, including unwanted speech from persons other than a user and also speech from a PA system. The speech recognition system is trained or otherwise programmed to identify wanted user speech which may be spoken concurrently with the background sounds. In an embodiment, during the pre-field-use phase the training or programming may be accomplished by having persons who are training listeners audit the pre-recorded sounds to identify the desired user speech. A processor-based learning system is trained to duplicate the assessments made by the human listeners.
G10L 15/16 - Speech classification or search using artificial neural networks
G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
G10L 25/78 - Detection of presence or absence of voice signals
12.
Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.
The present application provides methods, apparatuses, and/or the like that are configured to accurately and efficiently train a speech recognition speech rate model to generate an optimized digital signal from speech input of a user for decoding. For example, the decoding includes accessing a hint data object at or around the time of the speech input to determine an output hypothesis from a plurality of hypotheses that matches the speech input.
A worker resource management system may include a voice-directed mobile terminal that enables a dialog between a user and the voice-directed mobile terminal. At least one computer may be in communication with the mobile terminal. The computer can include a worker resource management module that receives and records user activity from the voice-directed mobile terminal. The worker resource management module can identify user productivity patterns and provide work assessment predictions based at least in part upon the user activity that is received and recorded. Management can make worker resource decisions in response to the user productivity patterns identified or the work assessment predictions provided by the worker resource management module.
A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.
Provided is an improved apparatus and method for identifying inefficient workers in a workplace. The apparatus receives voice commands from a worker in the workplace and transforms the voice commands into text commands by converting audio signals associated with the voice commands to one or more points in data that are stored in a database. The apparatus further creates a dialogue-stream including the text commands and a time of receipt of each voice command associated with the respective text command. Further, a workflow record may be created from the dialogue-stream and may be used to generate an efficiency metric for the worker. Both the dialogue-stream and workflow record can be used to monitor the efficiency of the worker and implement effective changes to the workflow process.
A method for identifying a second device by a first device for establishing a communication between the first device and the second device is described here. The method includes receiving, by a processor of a first device, a voice command from a worker in a workplace. In an example, the method comprises pausing, by the processor, a workflow operation executing on the first device. The method further comprises performing, by the processor, a voice recognition to analyze the voice command of the worker. The method includes activating, by the processor, a communication module of the first device based on the voice recognition, to identify a second device in proximity to the first device. The method includes terminating, by the processor, a connection between the first device and the wearable electronic device. Thus, terminating, by the processor, a second connection of the first device with the second device.
G06F 9/48 - Program initiatingProgram switching, e.g. by interrupt
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 15/30 - Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
H04B 5/00 - Near-field transmission systems, e.g. inductive or capacitive transmission systems
H04M 1/72412 - User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories using two-way short-range wireless interfaces
H04W 4/80 - Services using short range communication, e.g. near-field communication [NFC], radio-frequency identification [RFID] or low energy communication
18.
SYSTEMS AND METHODS FOR PROVIDING REAL-TIME ASSISTANCE
A method for providing real-time information to a first electronic device is described here. The method includes transmitting one or more workflow tasks from a host computer to a first electronic device communicatively coupled to the host computer. Further, the method comprises receiving sensor information from a plurality of sensors. The sensor information comprises at least one of a location of the first electronic device or data related to the one or more workflow tasks being performed by a worker. The method further comprises transmitting, to the first electronic device, a real-time information to be communicated to the worker based on the sensor information received from the plurality of sensors. In some examples, the real-time information comprises instructions in form of an assistance to be provided to the worker for performing the one or more workflow tasks.
G09B 5/02 - Electrically-operated educational appliances with visual presentation of the material to be studied, e.g. using film strip
G08B 7/06 - Signalling systems according to more than one of groups Personal calling systems according to more than one of groups using electric transmission
H04Q 9/00 - Arrangements in telecontrol or telemetry systems for selectively calling a substation from a main station, in which substation desired apparatus is selected for applying a control signal thereto or for obtaining measured values therefrom
A method for identifying a second device by a first device for establishing a communication between the first device and the second device is described here. The method includes receiving, by a processor of a first device, a voice command from a worker in a workplace. In an example, the method comprises pausing, by the processor, a workflow operation executing on the first device. The method further comprises performing, by the processor, a voice recognition to analyze the voice command of the worker. The method includes activating, by the processor, a communication module of the first device based on the voice recognition, to identify a second device in proximity to the first device. The method includes terminating, by the processor, a connection between the first device and the wearable electronic device. Thus, terminating, by the processor, a second connection of the first device with the second device.
G06F 9/48 - Program initiatingProgram switching, e.g. by interrupt
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 15/30 - Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
H04W 4/80 - Services using short range communication, e.g. near-field communication [NFC], radio-frequency identification [RFID] or low energy communication
H04B 5/00 - Near-field transmission systems, e.g. inductive or capacitive transmission systems
H04M 1/72412 - User interfaces specially adapted for cordless or mobile telephones with means for local support of applications that increase the functionality by interfacing with external accessories using two-way short-range wireless interfaces
Various embodiments described herein relate to determining and providing user-specific feedback based on an analysis of audible input sessions performed by a user. In this regard, a set of term recognition structures that each comprise a plurality of term data objects and a respective confidence score for each term data object are generated. For at least one pairing of term data objects of a predefined term glossary, a correlation coefficient value for the respective pairing is determined. In accordance with determining that the correlation coefficient value for the at least one pairing satisfies a predefined threshold a generate a visualization is generated and displayed that includes an indication of the term data objects of the at least one pairing.
G10L 15/01 - Assessment or evaluation of speech recognition systems
G06T 11/20 - Drawing from basic elements, e.g. lines or circles
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G10L 25/06 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being correlation coefficients
21.
Enabling workers to use a personal mobile device with a wearable electronic device
A method for facilitating picking of an incoming call on a device is described here. The method includes transmitting, by a processor of a work device, a workflow execution command to a first device communicatively coupled to the work device. In an example, the method comprises receiving at the work device, a first message indicative of occurrence of an event on a second device communicatively coupled to the work device. The method further comprises sending, by the processor, a first request to the first device to terminate a first connection with the work device. The method includes sending, by the processor, a second request to communicatively couple the first device to the second device. The method includes receiving, by the processor, a second message indicative of completion of the event on the second device. Thus, terminating, by the processor, a second connection of the first device with the second device.
H04W 76/15 - Setup of multiple wireless link connections
H04W 76/34 - Selective release of ongoing connections
H04W 4/80 - Services using short range communication, e.g. near-field communication [NFC], radio-frequency identification [RFID] or low energy communication
22.
Systems, methods, and apparatuses for improving performance of executing a workflow operation
A voice controlled apparatus for performing a workflow operation is described. The voice controlled apparatus can include a microphone, a speaker, and a processor. In some examples, the voice controlled apparatus can generate, via the speaker, a voice prompt associated with a task of a workflow and identify, via the microphone, a voice response received from a worker. In this regard, the voice prompt and the voice response can be a part of a voice dialogue. Further, the processor of the voice controlled apparatus can identify a performance status associated with the execution of the task, before providing a next voice prompt subsequent to the voice prompt. In this aspect, the performance status can be identified based on analyzing the voice dialogue using a machine learning model. Furthermore, the voice controlled apparatus can generate a message including a suggestion to improve the performance status of the task.
A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training environment exhibits both desired user speech and unwanted background speech, including unwanted speech from persons other than a user and also speech from a PA system. The speech recognition system is trained or otherwise programmed to identify wanted user speech which may be spoken concurrently with the background sounds. In an embodiment, during the pre-field-use phase the training or programming may be accomplished by having persons who are training listeners audit the pre-recorded sounds to identify the desired user speech. A processor-based learning system is trained to duplicate the assessments made by the human listeners.
G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise
G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
A worker resource management system may include a voice-directed mobile terminal that enables a dialog between a user and the voice-directed mobile terminal. At least one computer may be in communication with the mobile terminal. The computer can include a worker resource management module that receives and records user activity from the voice-directed mobile terminal. The worker resource management module can identify user productivity patterns and provide work assessment predictions based at least in part upon the user activity that is received and recorded. Management can make worker resource decisions in response to the user productivity patterns identified or the work assessment predictions provided by the worker resource management module.
A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training environment exhibits both desired user speech and unwanted background speech, including unwanted speech from persons other than a user and also speech from a PA system. The speech recognition system is trained or otherwise programmed to identify wanted user speech which may be spoken concurrently with the background sounds. In an embodiment, during the pre-field-use phase the training or programming may be accomplished by having persons who are training listeners audit the pre-recorded sounds to identify the desired user speech. A processor-based learning system is trained to duplicate the assessments made by the human listeners.
G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise
G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
Provided herein are a multi-microphone system and method including a controller, a plurality of transducers each operable within a unique sensitivity range, and corresponding microphone units. The controller receives a sound signal output from a first microphone unit that corresponds to a microphone unit having a transducer with the highest sensitivity. The controller analyzes the sound signal output to identify a first parameter of the sound signal output and determines if the first parameter satisfies pre-defined criteria. In an instance in which the first parameter satisfies the pre-defined criteria, the controller outputs the sound signal output of the selected first microphone unit as the output of the multi-microphone system. Otherwise, the controller receives a sound signal output from a second microphone unit comprising a corresponding transducer with a sensitivity less than the first microphone unit but greater than remaining transducers.
H04R 1/40 - Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
Provided is an improved apparatus and method for identifying inefficient workers in a workplace. The apparatus receives voice commands from a worker in the workplace and transforms the voice commands into text commands by converting audio signals associated with the voice commands to one or more points in data that are stored in a database. The apparatus further creates a dialogue-stream including the text commands and a time of receipt of each voice command associated with the respective text command. Further, a workflow record may be created from the dialogue-stream and may be used to generate an efficiency metric for the worker. Both the dialogue-stream and workflow record can be used to monitor the efficiency of the worker and implement effective changes to the workflow process.
A method of characterizing inventory items to a planogram includes receiving images of view of a scene with inventory items and corresponding indicia using an imaging device; receiving utterances (such as spoken description of inventory items) from a user using a voice recognition system; identifying inventory items in the scene and a corresponding identification code based at least in part on the images and at least in part on the utterances; identifying a plurality of attributes corresponding to the inventory items; and characterizing the inventory items to a planogram based on the respective identification code and plurality of attributes.
G06Q 10/08 - Logistics, e.g. warehousing, loading or distributionInventory or stock management
G06K 7/10 - Methods or arrangements for sensing record carriers by electromagnetic radiation, e.g. optical sensingMethods or arrangements for sensing record carriers by corpuscular radiation
G06K 7/14 - Methods or arrangements for sensing record carriers by electromagnetic radiation, e.g. optical sensingMethods or arrangements for sensing record carriers by corpuscular radiation using light without selection of wavelength, e.g. sensing reflected white light
30.
Communication headsets and systems for mobile application control and power savings
Headset assemblies and systems for mobile application control and power savings are provided. Headset assembly includes headband with proximity sensor activation element and receiver assembly configured to be rotated with respect to the headband between a first position and a second position. Receiver assembly has memory, proximity sensor, processor communicatively coupled to memory and proximity sensor, microphone boom having first and second ends, and microphone located adjacent first end and speaker located adjacent second end. Processor comprises PIO port. Proximity sensor activation element is positioned to trigger proximity sensor to activate PIO port when receiver assembly is rotated to the second position and to deactivate PIO port when receiver assembly is rotated to first position. Proximity sensor is configured to transmit at least one signal representing receiver assembly position.
A method to provide an order for a user to select goods includes automatically prompting a user via a voice command to collect an item from an order placed by a customer, and in response to receiving the input from an electronic device indicating that the item is unavailable, automatically prompting the user via a voice command to collect a substitute item designated, receiving an input indicating that the substitute item has been collected, followed by electronically notifying the customer that the substitute item has been collected instead of the original item. In another aspect, a voice-enabled click and collect system includes a remote server in communication with a mobile device, configured to prompt a user to collect an item, communicate the item unavailability to the remote server, receive instructions from the remote server, prompt the user to collect the substitute item, and notify the customer.
A method for text-to-speech (TTS) pronunciation correction includes initiating a correction of a pronunciation error produced by a TTS system; receiving an input from a user for the correction of the pronunciation error; and updating the pronunciation based on the input provided by the user. The user input can include providing one or more voice recording entries, a manual entry from a keyboard, and/or choosing an entry from a predetermined list of phonetic strings. The method can also include pronouncing the corrected pronunciation entry back to the user. The user input can be further used for correction of similar syllables, words, and/or phrases.
G10L 13/08 - Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
Methods and apparatuses to assist a user in the performance of a plurality of tasks are provided. The invention includes storing at least one care plan for a resident, the care plan defining a plurality of tasks to be performed for providing care to the resident. The method includes capturing speech inputs from the user and providing speech outputs to the user to provide a speech dialog with the user reflective of the care plan. Information is captured with a contactless communication interface and is used for engaging the care plan.
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G06F 19/00 - Digital computing or data processing equipment or methods, specially adapted for specific applications (specially adapted for specific functions G06F 17/00;data processing systems or methods specially adapted for administrative, commercial, financial, managerial, supervisory or forecasting purposes G06Q;healthcare informatics G16H)
G16H 10/20 - ICT specially adapted for the handling or processing of patient-related medical or healthcare data for electronic clinical trials or questionnaires
G16H 40/20 - ICT specially adapted for the management or administration of healthcare resources or facilitiesICT specially adapted for the management or operation of medical equipment or devices for the management or administration of healthcare resources or facilities, e.g. managing hospital staff or surgery rooms
34.
Communication headsets and systems for mobile application control and power savings
Headset assemblies and systems for mobile application control and power savings are provided. Headset assembly includes headband with proximity sensor activation element and receiver assembly configured to be rotated with respect to the headband between a first position and a second position. Receiver assembly has memory, proximity sensor, processor communicatively coupled to memory and proximity sensor, microphone boom having first and second ends, and microphone located adjacent first end and speaker located adjacent second end. Processor comprises PIO port. Proximity sensor activation element is positioned to trigger proximity sensor to activate PIO port when receiver assembly is rotated to the second position and to deactivate PIO port when receiver assembly is rotated to first position. Proximity sensor is configured to transmit at least one signal representing receiver assembly position.
A portable terminal has a network interface that receives a set of instructions having a sequence of at least one location and audio properties associated with the at least one location from a server. An audio circuit receives audio signals picked up by a microphone and processes the audio signals in a manner defined by the audio properties associated with the at least one location. A speech recognition module receives processed signals from the audio circuit and carries out a speech recognition process thereupon.
A device, system, and method whereby a speech-driven system used in an industrial environment distinguishes speech obtained from users of the system from other background sounds. In one aspect, the present system and method provides for a first audio stream from a user microphone collocated with a source of human speech (that is, a user) and a second audio stream from a environmental microphone which is proximate to the source of human speech but more remote than the user microphone. The audio signals from the two microphones are asynchronous. A processor is configured to identify a common, distinctive sound event in the environment, such as an impulse sound or a periodic sound signal. Based on the common sound event, the processor provides for synchronization of the two audio signals. In another aspect, the present system and method provides for a determination of whether or not the sound received at the user microphone is suitable for identification of words in a human voice, based on a comparison of sound elements in the first audio stream and the second audio stream, for example based on a comparison of the sound intensities of the sound elements in the audio streams.
G10L 19/00 - Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocodersCoding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 21/0216 - Noise filtering characterised by the method used for estimating noise
G10L 21/0264 - Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
A method improves speech recognition using a device located in proximity to a machine emitting high levels of audio noise. The microphone of the device receives the audio noise emitted by the machine and the speech emitted by a user and generates a composite signal. The device also receives a wireless communication signal from the machine comprising information on an audio noise profile and the proximity of the machine relative to the device. The audio noise profile is a representation of the audio noise emitted by the machine. Based on this information, the device determines a filter for filtering the composite signal to mitigate the audio noise before initiating the speech recognition process. The method improves speech recognition in a high audio noise environment.
G10L 21/00 - Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G10L 25/21 - Speech or voice analysis techniques not restricted to a single one of groups characterised by the type of extracted parameters the extracted parameters being power information
G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/0216 - Noise filtering characterised by the method used for estimating noise
Systems and methods for monitoring worker movement within a warehouse setting are provided. According to one implementation, a wearable apparatus comprises an audio unit configured to receive audio input signals from a user and convey audio output signals to the user. Also, the wearable apparatus includes a step count unit configured to detect walking movements of the user.
A device, system, and method whereby a speech-driven system can distinguish speech obtained from users of the system from other speech spoken by background persons, as well as from background speech from public address systems. In one aspect, the present system and method prepares, in advance of field-use, a voice-data file which is created in a training environment. The training environment exhibits both desired user speech and unwanted background speech, including unwanted speech from persons other than a user and also speech from a PA system. The speech recognition system is trained or otherwise programmed to identify wanted user speech which may be spoken concurrently with the background sounds. In an embodiment, during the pre-field-use phase the training or programming may be accomplished by having persons who are training listeners audit the pre-recorded sounds to identify the desired user speech. A processor-based learning system is trained to duplicate the assessments made by the human listeners.
G06Q 20/40 - Authorisation, e.g. identification of payer or payee, verification of customer or shop credentialsReview and approval of payers, e.g. check of credit lines or negative lists
G10L 25/84 - Detection of presence or absence of voice signals for discriminating voice from noise
G10L 25/51 - Speech or voice analysis techniques not restricted to a single one of groups specially adapted for particular use for comparison or discrimination
A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.
A mobile device can receive information from a computer; at least partially in response to the receiving of the information from the computer, the mobile device can communicate with a worker regarding a product type; the mobile device can receive information from the worker via voice, wherein the information received from the worker can identify a date or other product information associated with a product of the product type; and the mobile device can provide the product information to the computer. The mobile device can provide a second voice prompt to the worker, wherein the second voice prompt can request a quantity of products of the product type that are marked with the product information; then the mobile device can receive a quantity from the worker via voice; and the mobile device can report to the computer.
Systems/Methods are provided for reducing picking operation errors. Task instruction is received about an item to be picked. A unique identifier is received for a pick location where the item is located. A unique identifier signal is received from a computing device at the pick location. A sensor thereof is commanded to output a signal in response to detecting unique identifier signal. The outputted signal represents an ambient magnetic field measurement in the sensor's vicinity received and monitored to detect change. Vehicle motion information is received from a motion detector. A notification whether the vehicle is at the correct pick location where item is located or the incorrect pick location is outputted.
G06K 7/14 - Methods or arrangements for sensing record carriers by electromagnetic radiation, e.g. optical sensingMethods or arrangements for sensing record carriers by corpuscular radiation using light without selection of wavelength, e.g. sensing reflected white light
G06K 7/10 - Methods or arrangements for sensing record carriers by electromagnetic radiation, e.g. optical sensingMethods or arrangements for sensing record carriers by corpuscular radiation
A system and method for identifying a specific RFID tag includes RFID reader circuitry, such as within an RFID reader, configured for sending and receiving RF signals to detect RFID tags and for obtaining signal parameter information associated with the RFID tags. Processing circuitry is configured for using the signal parameter information for one or more tags of the RFID tags and calculating a tag score for the one or more RFID tags. The processing circuitry is further configured for determining a specific RFID tag using the tag scores for the one or more RFID tags.
H04Q 5/22 - Selecting arrangements wherein two or more subscriber stations are connected by the same line to the exchange with indirect connection, i.e. through subordinate switching centre the subordinate centre not permitting interconnection of subscribers connected thereto
G06K 7/10 - Methods or arrangements for sensing record carriers by electromagnetic radiation, e.g. optical sensingMethods or arrangements for sensing record carriers by corpuscular radiation
G06K 7/00 - Methods or arrangements for sensing record carriers
G06Q 10/08 - Logistics, e.g. warehousing, loading or distributionInventory or stock management
46.
Systems and methods for worker resource management
A worker resource management system may include a voice-directed mobile terminal that enables a dialog between a user and the voice-directed mobile terminal. At least one computer may be in communication with the mobile terminal. The computer can include a worker resource management module that receives and records user activity from the voice-directed mobile terminal. The worker resource management module can identify user productivity patterns and provide work assessment predictions based at least in part upon the user activity that is received and recorded. Management can make worker resource decisions in response to the user productivity patterns identified or the work assessment predictions provided by the worker resource management module.
A method, apparatus, and program product for generating check data for a location within an area of a workspace include receiving an identifier for a selected location that has check data associated therewith. Candidate check data for use with the selected location is generated. The candidate check data is evaluated for a match against at least one of existing check data for the selected location or check data associated with a related location. Based on the evaluation, a determination is made of whether the candidate check data is acceptable for use for the selected location.
Methods and apparatuses to assist a user in the performance of a plurality of tasks are provided. The invention includes storing at least one care plan for a resident, the care plan defining a plurality of tasks to be performed for providing care to the resident. The method includes capturing speech inputs from the user and providing speech outputs to the user to provide a speech dialog with the user reflective of the care plan. Information is captured with a contactless communication interface and is used for engaging the care plan.
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G06F 19/00 - Digital computing or data processing equipment or methods, specially adapted for specific applications (specially adapted for specific functions G06F 17/00;data processing systems or methods specially adapted for administrative, commercial, financial, managerial, supervisory or forecasting purposes G06Q;healthcare informatics G16H)
A method and system of managing power usage of devices including selectively executing a program application on a plurality of battery powered devices. Battery usage data is generated for a battery in one or more of the devices during execution of the work application. The battery usage data includes the run-time of the battery for the work application being executed. The data is aggregated and stored for the plurality of devices in memory. An application specific battery profile is generated using the stored battery usage data. The application specific battery profile is associated with the work application being run by the client devices.
Embodiments of the invention address the deficiencies of the prior art by providing a method, apparatus, and program product to of converting components of a web page to voice prompts for a user. In some embodiments, the method comprises selectively determining at least one HTML component from a plurality of HTML components of a web page to transform into a voice prompt for a mobile system based upon a voice attribute file associated with the web page. The method further comprises transforming the at least one HTML component into parameterized data suitable for use by the mobile system based upon at least a portion of the voice attribute file associated with the at least one HTML component and transmitting the parameterized data to the mobile system.
G10L 13/00 - Speech synthesisText to speech systems
G10L 13/08 - Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
G10L 21/00 - Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G10L 13/02 - Methods for producing synthetic speechSpeech synthesisers
G06F 17/22 - Manipulating or registering by use of codes, e.g. in sequence of text characters
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
53.
Method and system for correctly identifying specific RFID tags
A system and method for identifying a specific RFID tag includes RFID reader circuitry, such as within an RFID reader, configured for sending and receiving RF signals to detect RFID tags and for obtaining signal parameter information associated with the RFID tags. Processing circuitry is configured for using the signal parameter information for one or more tags of the RFID tags and calculating a tag score for the one or more RFID tags. The processing circuitry is further configured for determining a specific RFID tag using the tag scores for the one or more RFID tags.
G06K 7/00 - Methods or arrangements for sensing record carriers
G06K 7/10 - Methods or arrangements for sensing record carriers by electromagnetic radiation, e.g. optical sensingMethods or arrangements for sensing record carriers by corpuscular radiation
G06Q 10/08 - Logistics, e.g. warehousing, loading or distributionInventory or stock management
54.
Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
A method and apparatus that dynamically adjust operational parameters of a text-to-speech engine in a speech-based system are disclosed. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.
G10L 21/00 - Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G10L 13/02 - Methods for producing synthetic speechSpeech synthesisers
G10L 13/033 - Voice editing, e.g. manipulating the voice of the synthesiser
55.
System and method for generating and updating location check digits
A method, apparatus, and program product for generating check data for a location within an area of a workspace include receiving an identifier for a selected location that has check data associated therewith. Candidate check data for use with the selected location is generated. The candidate check data is evaluated for a match against at least one of existing check data for the selected location or check data associated with a related location. Based on the evaluation, a determination is made of whether the candidate check data is acceptable for use for the selected location.
A self calibrating dipole microphone formed from two omni-directional acoustic sensors. The microphone includes a sound source acoustically coupled to the acoustic sensors and a processor. The sound source is excited with a test signal, exposing the acoustic sensors to acoustic calibration signals. The responses of the acoustic sensors to the calibration signals are compared by the processor, and one or more correction factors determined. Digital filter coefficients are calculated based on the one or more correction factors, and applied to the output signals of the acoustic sensors to compensate for differences in the sensitivities of the acoustic sensors. The filtered signals provide acoustic sensor outputs having matching responses, which are subtractively combined to form the dipole microphone output.
A system and method for supplying power to a headset, and for transmitting multiple signals generated in the headset to a terminal using frequency division multiplexing. An audio signal and a carrier signal are generated in the terminal and summed together to form a composite uplink signal. The composite uplink signal is provided to a headset over a first physical channel. At the headset, the audio and carrier signals are separated, and the carrier signal is used to generate power in the headset. Signals generated by a plurality of acoustic sensors in the headset are combined using frequency division multiplexing to generate a composite downlink signal, which is transmitted to the terminal over a second physical channel. One or more carrier signals used to generate the composite downlink signal are provided by either a carrier source in the headset, or by recovering the carrier signal from the composite uplink signal.
A method and apparatus are provided for allowing a user to change the stiffness of their headband, thus changing the clamping force of the headset. The section properties of the headband equivalent spring, which is a cantilevered beam, are changed by engaging a secondary spring, which is also a shorter cantilevered beam. The change is achieved by manipulating the effective length of the secondary spring, which can be accomplished with a set of clamps. Thus, the stiffness of the headband can be modified to the desire of the user.
A communication component modifies production of an audio waveform at determined modification segments to thereby mitigate the effects of a delay in processing and/or receiving a subsequent audio waveform. The audio waveform and/or data associated with the audio waveform are analyzed to identify the modification segments based on characteristics of the audio waveform and/or data associated therewith. The modification segments show where the production of the audio waveform may be modified without substantially affecting the clarity of the sound or audio. In one embodiment, the invention modifies the sound production at the identified modification segments to extend production time and thereby mitigate the effects of delay in receiving and/or processing a subsequent audio waveform for production.
G10L 21/047 - Time compression or expansion by changing speed using thinning out or insertion of a waveform characterised by the type of waveform to be thinned out or inserted
G10L 13/08 - Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
60.
System and method for improving speech recognition accuracy in a work environment
Apparatus and method that improves speech recognition accuracy, by monitoring the position of a user's headset-mounted speech microphone, and prompting the user to reconfigure the speech microphone's orientation if required. A microprocessor or other application specific integrated circuit provides a mechanism for comparing the relative transit times between a user's voice, a primary speech microphone, and a secondary compliance microphone. The difference in transit times may be used to determine if the speech microphone is placed in an appropriate proximity to the user's mouth. If required, the user is automatically prompted to reposition the speech microphone.
A power delivery method and system for powering a headset. A power signal is combined with an audio signal to form a composite signal that is communicated over a shared channel to the headset. The power signal is generated by modulating a carrier signal with a modulation signal. The modulation signal is derived from the amplitude of the audio signal so that the peak levels of the composite signal do not exceed the maximum allowable output of an audio I/O circuit driving the headset.
A system and method for identifying a specific RFID tag includes RFID reader circuitry, such as within an RFID reader, configured for sending and receiving RF signals to detect RFID tags and for obtaining signal parameter information associated with the RFID tags. Processing circuitry is configured for using the signal parameter information for one or more tags of the RFID tags and calculating a tag score for the one or more RFID tags. The processing circuitry is further configured for determining a specific RFID tag using the tag scores for the one or more RFID tags.
H04Q 5/22 - Selecting arrangements wherein two or more subscriber stations are connected by the same line to the exchange with indirect connection, i.e. through subordinate switching centre the subordinate centre not permitting interconnection of subscribers connected thereto
G06K 7/10 - Methods or arrangements for sensing record carriers by electromagnetic radiation, e.g. optical sensingMethods or arrangements for sensing record carriers by corpuscular radiation
G06K 7/00 - Methods or arrangements for sensing record carriers
G06Q 10/08 - Logistics, e.g. warehousing, loading or distributionInventory or stock management
A method and system for remotely managing mobile devices over a communication network including transceiving user input data generated at a mobile management terminal to a management computer over a communication network. The transceived user input data is analyzed to determine whether the user input data corresponds to a stored command function and the stored command function is executed at the management computer in response to determining that the user input data corresponds to a stored command function. Data is transceived from the management computer to one or more mobile host devices over the communication network.
Receiving application specific individual battery adjusted battery use profile data upon loading of work application for managing remaining power of a mobile device
A method and system of managing power usage of devices including selectively executing a program application on a plurality of battery powered devices. Battery usage data is generated for a battery in one or more of the devices during execution of the work application. The battery usage data includes the run-time of the battery for the work application being executed. The data is aggregated and stored for the plurality of devices in memory. An application specific battery profile is generated using the stored battery usage data. The application specific battery profile is associated with the work application being run by the client devices.
A system and method for the operation and navigation of a moving vehicle in a warehouse environment includes a controller for controlling operation of the vehicle and an imaging device configured to capture an image representing a view from the vehicle. Location labels are configured for being positioned in the warehouse environment and include location indicia.
A navigation processor is operatively coupled to the imaging device and configured to determine a location of the vehicle based at least in part on the image of the location label, the navigation processor further coupled with the controller to control the movement of the vehicle based on the determined location.
G01C 22/00 - Measuring distance traversed on the ground by vehicles, persons, animals or other moving solid bodies, e.g. using odometers or using pedometers
G05D 1/00 - Control of position, course, altitude or attitude of land, water, air or space vehicles, e.g. using automatic pilots
G05D 1/02 - Control of position or course in two dimensions
A system and method for supplying power to a headset, and for transmitting multiple signals generated in the headset to a terminal using frequency division multiplexing. An audio signal (66) and a carrier signal (68) are generated in the terminal and summed together to form a composite uplink signal (70). The composite uplink signal (70) is provided to a headset (14) over a first physical channel (28). At the headset (14), the audio and carrier signals are separated, and the carrier signal (68) is used to generate power in the headset. Signals generated by a plurality of acoustic sensors (60, 62) in the headset are combined using frequency division multiplexing to generate a composite downlink signal (76), which is transmitted to the terminal over a second physical channel.
Method, apparatus and program product for generating check data for a location within an area of a workspace include receiving an identifier for a selected location that has check data associated therewith. Candidate check data for use with the selected location is generated. The candidate check data is evaluated for a match against at least one of existing check data for the selected location or check data associated with a related location. Based on the evaluation, a determination is made of whether the candidate check data is acceptable for use for the selected location.
Method and apparatus that dynamically adjusts operational parameters of a text-to-speech engine in a speech-based system. A voice engine or other application of a device provides a mechanism to alter the adjustable operational parameters of the text-to-speech engine. In response to one or more environmental conditions, the adjustable operational parameters of the text-to-speech engine are modified to increase the intelligibility of synthesized speech.
G10L 13/08 - Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
G10L 13/00 - Speech synthesisText to speech systems
G10L 15/20 - Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise or of stress induced speech
G10L 21/02 - Speech enhancement, e.g. noise reduction or echo cancellation
G10L 21/00 - Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G10L 19/14 - Details not provided for in groups ; G10L 19/06-G10L 19/12, e.g. gain coding, post filtering design or vocoder structure
G10L 17/00 - Speaker identification or verification techniques
A self calibrating dipole microphone formed from two omni-directional acoustic sensors. The microphone includes a sound source acoustically coupled to the acoustic sensors and a processor. The sound source is excited with a test signal, exposing the acoustic sensors to acoustic calibration signals. The responses of the acoustic sensors to the calibration signals are compared by the processor, and one or more correction factors determined. Digital filter coefficients are calculated based on the one or more correction factors, and applied to the output signals of the acoustic sensors to compensate for differences in the sensitivities of the acoustic sensors. The filtered signals provide acoustic sensor outputs having matching responses, which are subtractively combined to form the dipole microphone output.
09 - Scientific and electric apparatus and instruments
Goods & Services
Computer software for management via the internet of voice-based computer systems; computer software for interfacing with computer systems utilizing voice applications
Methods and apparatuses to assist a user in the performance of a plurality of tasks are provided. The invention includes storing at least one care plan for a resident, the care plan defining a plurality of tasks to be performed for providing care to the resident. Capturing speech inputs from the user, and providing speech outputs to the user to provide a speech dialog with the user reflective of the care plan. Information is captured with a contactless communication interface and is used for engaging the care plan.
G10L 21/06 - Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
G06F 19/00 - Digital computing or data processing equipment or methods, specially adapted for specific applications (specially adapted for specific functions G06F 17/00;data processing systems or methods specially adapted for administrative, commercial, financial, managerial, supervisory or forecasting purposes G06Q;healthcare informatics G16H)
74.
COOPERATIVE VOICE DIALOG AND BUSINESS LOGIC INTERPRETERS FOR A VOICE-ENABLED SOFTWARE APPLICATION
Embodiments of the invention address the deficiencies of the prior art by providing a method, apparatus, and program product to cooperatively mediate between voice-enabled operations and business logic. The method comprises receiving XML data and generating at least one object from the XML data. The method further comprises, in response to determining that the at least one object has been called, implementing an operation defined by a portion of the object.
A system for executing a multimodal software application 52 includes a mobile computer device 14 with a plurality of input interface components, the multimodal software application 52, and a dialog engine 62 in operative communication with the multimodal software application 52. The multimodal software application 52 is configured to receive first data from the plurality of input interface components. The dialog engine 62 executes a workflow description 60 from the multimodal software application 52 by providing prompts to an output interface component. Each of these prompts includes notification indicating which of the input interface components are valid receivers for that respective prompt. Furthermore, the notification may indicate the current prompt and at least the next prompt in sequence.
A system for executing a multimodal software application includes a mobile computer device with a plurality of input interface components, the multimodal software application, and a dialog engine in operative communication with the multimodal software application. The multimodal software application is configured to receive first data from the plurality of input interface components. The dialog engine executes a workflow description from the multimodal software application by providing prompts to an output interface component. Each of these prompts includes notification indicating which of the input interface components are valid receivers for that respective prompt. Furthermore, the notification may indicate the current prompt and at least the next prompt in sequence.
G06F 3/0481 - Interaction techniques based on graphical user interfaces [GUI] based on specific properties of the displayed interaction object or a metaphor-based environment, e.g. interaction with desktop elements like windows or icons, or assisted by a cursor's changing behaviour or appearance
G06F 9/06 - Arrangements for program control, e.g. control units using stored programs, i.e. using an internal store of processing equipment to receive or retain programs
Embodiments of the invention provide for a headset 12 and system including a portable device 10, wherein the headset 12 includes an electrically conductive band 30 configured to engage a portion of a head of a user 14 and coupling together a headpiece 32 and an earpiece assembly 34. The headpiece 32 includes an electrically conductive portion that is configured to engage and electrically couple to the user's head. The headpiece 32 is electrically coupled with the conductive band 30. The earpiece assembly 34 is coupled with the conductive band 30 to engage an ear of a user 14 and has a ground conductor 72 that is electrically coupled between the conductive band 30 and a portable device 10 for coupling the portable device 10 to the head of a user 14 for reducing electrostatic discharge at the head of the user 14 of the portable device 10.
An RFID reader (10) configured to be worn on the arm (16, 18) includes a housing (24) with a bottom surface (48, 58) and a top surface (42), A securement structure engages the arm (16, 18) of the user (16) to secure the housing (24). An antenna (26) is mounted within the housing (48) and is angled with respect to the bottom surface (48, 58) for providing an RF field (20) at an angle to the arm (16, 18) to direct the RF field (20) toward an item held by the arm (16, 18). The antenna (26) includes an indexing structure (76) with multiple angular positions for orientation of the antenna (26). A key structure (78) is configured for engaging the indexing structure (76) at an angular position to orient the antenna (26) at a desired rotational orientation to tune the antenna (26). Grip structures (56) are formed along the top surface (42) of the housing (24) for gripping an item (14) that contains an RFID tag (12).
G06K 7/10 - Methods or arrangements for sensing record carriers by electromagnetic radiation, e.g. optical sensingMethods or arrangements for sensing record carriers by corpuscular radiation
H01Q 1/27 - Adaptation for use in or on movable bodies
80.
Method and system for correctly identifying specific RFID tags
A system and method for identifying a specific RFID tag includes RFID reader circuitry, such as within an RFID reader, configured for sending and receiving RF signals to detect RFID tags and for obtaining signal parameter information associated with the RFID tags. Processing circuitry is configured for using the signal parameter information for one or more tags of the RFID tags and calculating a tag score for the one or more RFID tags. The processing circuitry is further configured for determining a specific RFID tag using the tag scores for the one or more RFID tags.
H04Q 5/22 - Selecting arrangements wherein two or more subscriber stations are connected by the same line to the exchange with indirect connection, i.e. through subordinate switching centre the subordinate centre not permitting interconnection of subscribers connected thereto
81.
METHOD AND SYSTEM FOR CORRECTLY IDENTIFYING SPECIFIC RFID TAGS
A system and method for identifying a specific RFID tag 40 includes RFID reader circuitry 32, such as within an RFID reader 24, configured for sending and receiving RF signals 38 to detect RFID tags 40 and for obtaining signal parameter information associated with the RFID tags 40. Processing circuitry 32 is configured for using the signal parameter information for one or more tags of the RFID tags 40 and calculating a tag score for the one or more RFID tags 40. The processing circuitry 32 is further configured for determining a specific RFI D tag 40 using the tag scores for the one or more RFID tags 40.
G06K 7/00 - Methods or arrangements for sensing record carriers
G01V 15/00 - Tags attached to, or associated with, an object, in order to enable detection of the object
G06Q 90/00 - Systems or methods specially adapted for administrative, commercial, financial, managerial or supervisory purposes, not involving significant data processing
A cord assembly (10) for an electronic device (12) includes a cord (18) having an insulation layer (44) with a first end (20, 52) and an exterior surface (46), as well as at least one electrical conductor (48) extending within the insulation layer (44). A strength member (50) also extends within the insulation layer (44) and includes a first end (20, 52) extending beyond the first end (20, 52) of the insulation layer (44). A securing element (56, 86) couples the first end (20, 52) of the strength member (50) with the first end (20, 52) of the insulation layer (44) along the exterior surface (46). The strength member (50) is pre-tensioned to carry any additional tensile loads on the cord (18), at least initially. The cord assembly (10) may also include a strain relief member (42,72) molded over the first end (20, 52) of the strength member and the securing element (56, 86).
A method of authenticating the interoperability of a headset 12 and a device 10, as well as a headset 12 and a device 10, is provided. The method includes, in a headset 12, monitoring at least one input for a verification signal, and, in response to failing to detect the verification signal within a predetermined period of time, selectively disabling a speaker 22 and/or microphone 24 of the headset 12. An alternative method includes, in a device 10, detecting a coupling of a headset 12 to the device 10, transmitting a verification signal to the headset 12, and receiving, from the headset 12, at least one of a serial number associated with the headset 12, an identification of a user 14 of the headset 12, a security certificate, or a voice translation template associated with the user 14.
Embodiments of the invention address the deficiencies of the prior art by providing a method, apparatus, and program product to of converting components of a web page to voice prompts for a user. In some embodiments, the method comprises selectively determining at least one HTML component from a plurality of HTML components of a web page to transform into a voice prompt for a mobile system based upon a voice attribute file associated with the web page. The method further comprises transforming the at least one HTML component into parameterized data suitable for use by the mobile system based upon at least a portion of the voice attribute file associated with the at least one HTML component and transmitting the parameterized data to the mobile system.
G10L 13/00 - Speech synthesisText to speech systems
G10L 13/08 - Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
G10L 21/00 - Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
G06F 17/30 - Information retrieval; Database structures therefor
G10L 15/22 - Procedures used during a speech recognition process, e.g. man-machine dialog
A method of authenticating the interoperability of a headset and a device, as well as a headset and a device, is provided. The method includes, in a headset, monitoring at least one input for a verification signal, and, in response to failing to detect the verification signal within a predetermined period of time, selectively disabling a speaker and/or microphone of the headset. An alternative method includes, in a device, detecting a coupling of a headset to the device, transmitting a verification signal to the headset, and receiving, from the headset, at least one of a serial number associated with the headset, an identification of a user of the headset, a security certificate, or a voice translation template associated with the user.
Embodiments of the invention address the deficiencies of the prior art by providing a method, apparatus, and program product to of converting components of a web page to voice prompts for a user. In some embodiments, the method comprises selectively determining at least one HTML component from a plurality of HTML components of a web page to transform into a voice prompt for a mobile system (16) based upon a voice attribute file associated with the web page. The method further comprises transforming the at least one HTML component into parameterized data suitable for use by the mobile system (16) based upon at least a portion of the voice attribute file (48) associated with the at least one HTML component and transmitting the parameterized data to the mobile system (16).
A speech recognition system receives and analyzes speech input from a user in order to recognize and accept a response from the user. Under certain conditions, information about the response expected from the user may be available. In these situations, the available information about the expected response is used to modify the behavior of the speech recognition system by taking this information into account. The modified behavior of the speech recognition system comprises adjusting the rejection threshold when speech input matches the predetermined expected response.
An electrical connector 12 for electrically connecting a terminal to a cord 18 of a peripheral device 16, the electrical connector 12 including a plug member 20, a lever arm 40 pivotally attached to the plug member 20, and a biasing member 44. The plug member 20 and lever arm 40 each include an engagement claw 36, 38 adapted to engage complementary surfaces on the terminal, and the biasing member 44 imparts a biasing force on the lever arm 40 to force the engagement claws 36, 38 to close onto the complementary surfaces on the terminal. The lever arm 40 advantageously includes a first gripping surface 52 with a contoured concave profile 48 for a user to apply a releasing force against the biasing force to remove the plug member 20 from the terminal. A second surface 72 includes a concave surface 74, 76 for engaging a user's finger and forming a grip bump 77.
An electrical connector for electrically connecting a terminal to a cord of a peripheral device, the electrical connector including a plug member, a lever arm pivotally attached to the plug member, and a biasing member. The plug member and lever arm each include an engagement claw adapted to engage complementary surfaces on the terminal, and the biasing member imparts a biasing force on the lever arm to force the engagement claws to close onto the complementary surfaces on the terminal. The lever arm advantageously includes a first gripping surface with a contoured concave profile for a user to apply a releasing force against the biasing force to remove the plug member from the terminal. A second surface includes a concave surface for engaging a user's finger and forming a grip bump.
A cord assembly for an electronic device includes a cord having an insulation layer with a first end and an exterior surface, as well as at least one electrical conductor extending within the insulation layer. A strength member also extends within the insulation layer and includes a first end extending beyond the first end of the insulation layer. A securing element couples the first end of the strength member with the first end of the insulation layer along the exterior surface. The strength member is pre-tensioned to carry any additional tensile loads on the cord, at least initially. The cord assembly may also include a strain relief member molded over the first end of the strength member and the securing element.
A system and method of assisting a care provider in the documentation of self-performance and support information for a resident or person includes a speech dialog with a care provider that uses the generation of speech to play to the care provider and the capture of speech spoken by a care provider. The speech dialog provides assistance to the care provider in providing care for a person according to a care plan for the person. The care plan includes one or more activities requiring a level of performance by the person. For the activity, speech inquiries are provided to the care provider, through the speech dialog, regarding performance of the activity by the person and regarding care provider assistance in the performance of the activity by the person. Speech input is captured from the care provider that is responsive to the speech inquiries. A code is then determined from the speech input and the code indicates the self-performance of the person and support information for a care provider for the activity.
G10L 21/00 - Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
A method for sending messages in a voice-enabled system and a voice- enabled system to communicate a message are provided. The method comprises generating a message with a message generating device 102, 106, analyzing the message to determine a voice-enabled device 106, 108 to send the message, and determining whether the voice-enabled device 106, 108 is available to receive the message. The method further comprises sending the message to the voice-enabled device 106, 108 in response to determining that the voice-enabled device 106, 108 is available to receive the message and, in response to determining that the voice- enabled device 106, 108 is not available, escalating the message based on an escalation protocol.
A method for model adaptation for a speech recognition system includes determining an error rate, corresponding to either recognition of instances of a word or recognition of instances of various words, without using a transcript of words input to the system. The method may further include adjusting an adaptation, of the model for the word or various models for the various words, based on the error rate. Apparatus are disclosed for identifying possible errors made by a speech recognition system without using a transcript of words input to the system. An apparatus for model adaptation for a speech recognition system includes a processor adapted to estimate an error rate, corresponding to either recognition of instances of a word or recognition of instances of various words, without using a transcript of words input to the system. The apparatus may further include a controller adapted to adjust an adaptation of the model for the word or various models for the various words, based on the error rate.
A method for model adaptation for a speech recognition system includes determining an error rate, corresponding to either recognition of instances of a word or recognition of instances of various words, without using a transcript of words input to the system. The method may further include adjusting an adaptation, of the model for the word or various models for the various words, based on the error rate. Apparatus are disclosed for identifying possible errors made by a speech recognition system without using a transcript of words input to the system. An apparatus for model adaptation for a speech recognition system includes a processor adapted to estimate an error rate, corresponding to either recognition of instances of a word or recognition of instances of various words, without using a transcript of words input to the system. The apparatus may further include a controller adapted to adjust an adaptation of the model for the word or various models for the various words, based on the error rate.
A speech-directed system 50 for doing tasks utilizing human speech includes a headset 12 including a microphone 16 for capturing user speech from a user 10 and a speaker 14 for playing audio to a user 10. A speech recognition component is resident on the headset 12 and operable for converting the user speech to data in a data format. A WPAN radio component 44 is resident on the headset 12 and is configured for converting the user speech data from the data format into a protocol format. A host device 18, 20, 22, 24, 30, 52, 62, 102, 104 is configured with a VVPAN radio component 46 for transceiving user speech data with the headset 12 in the protocol format. A long range wireless network component that is resident on the host device 24 couples with at least one remote device 32 through a long range wireless network 30. The host device 24 is operable for transceiving the user speech data with the remote device 32.
A headset includes an element to be held to the head of a wearer and a headband coupled to the element and configured for engaging a head of a wearer to hold the element thereon. The headband includes a plurality of headband arms coupled to pivot with respect to each other. A torsion spring is positioned between the arms and portions of the torsion spring are coupled to the arms for acting on the arms with a torsion force. An adjustment member is coupled between the torsion spring and a headband arm and is operable for adjusting the torsion force of the torsion spring to adjust the torsion force on the arms.
A headset includes an element to be held to the head of a wearer and a headband 10 coupled to the element and configured for engaging a head of a wearer to hold the element thereon. The headband 10 includes a plurality of headband arms 12, 14 coupled to pivot with respect to each other. A torsion spring 30 is positioned between the arms and portions of the torsion spring 30 are coupled to the arms 12, 14 for acting on the arms 12, 14 with a torsion force. An adjustment member is coupled between the torsion spring 30 and a headband arm 12, 14 and is operable for adjusting the torsion force of the torsion spring 30 to adjust the torsion force on the arms 12, 14.
A speech-directed user interface system includes at least one speaker 16, 50 for delivering an audio signal to a user 10, 60 and at least one microphone 18, 52 for capturing speech utterances of a user 10, 60. An interface device 30 interfaces with the speaker 16, 50 and microphone 18, 52 and provides a plurality of audio signals to the speaker 16, 50 to be heard by the user 10, 60. A control circuit is operably coupled with the interface device 30 and is configured for selecting at least one of the plurality of audio signals as a foreground audio signal for delivery to the user 10, 60 through the speaker 16, 50. The control circuit is operable for recognizing speech utterances of a user 10, 60 and using the recognized speech utterances to control the selection of the foreground audio signal.
09 - Scientific and electric apparatus and instruments
42 - Scientific, technological and industrial services, research and design
Goods & Services
Software for creating applications used in performing warehouse management functions. Design, development, consulting, and implementation services for software, hardware, and technology solutions for use with wearable wireless computer terminals.