Resources >‏ News >‏ News Details

News Details

QCRI in deal with UK’s Speechmatics to take Arabic transcription technology global

Publication Date:
16/05/2016
Category:
News
QCRI-Logo-new-2016.jpg

UK-based company Speechmatics is to use technology developed by the Qatar Computing Research Institute (QCRI), a division of Hamad bin Khalifa University, to take Arabic speech-to-text services to its global customers.

The Cambridge-based Speechmatics will use a QCRI product known as QATS (QCRI Advanced Transcription System) to transcribe Arabic broadcasts and audio files into text and subtitles.

QATS can transcribe modern standard Arabic as well as four major Arabic dialects: Egyptian, Levantine, North African and Gulf Arabic.

QCRI’s executive director Ahmed Elmagarmid said Qatar was leading global research in speech technology for Arabic.

“This is not just a technology transfer - it is much bigger. It will allow information sharing in Arabic around the world,” Dr Elmagarmid said.

Speechmatics’ chief scientific officer Tony Robinson said the development, which used the company’s recently-announcement Auto-Auto framework, would ensure Arabic-based content was “more discoverable and easily consumed”.  Dr Robinson was a pioneer in developing the application of deep learning in speech recognition in the 1980s and 1990s at the University of Cambridge.

“Speechmatics will help QCRI expand their reach to a broad range of industries and geographies with market leading speech-to-text services based on the latest research in machine learning and artificial intelligence,” Dr Robinson said.

QCRI Arabic Language Technologies principal engineer Ahmed Ali, who has been leading the speech team, said Deep Neural Network (DNN) and Recurrent Neural Network (RNN) architecture were used in QATS’ development.

“We used more than 2,000 hours of Arabic speech to develop and train QATS, in addition to a large archive of the Web 2.0 content,” Ali said.

Ali said the agreement would also give QCRI access to diverse data which would enable it to further hone its Arabic language technologies research.

An independent media monitoring company in May 2015 found QATS consistently outperformed its leading competitors in both standard and dialectal Arabic benchmark tests by at least 10 per cent.

It also won the “Best in Show” award at the third edition of the BBC’s #NewsHACK event in December 2014 for translating BBC Arabic videos into English, including subtitles, and voiceover using speech synthesis.

Al Jazeera Media Network has been using versions of QATS to transcribe its daily Arabic news reports for almost two years. Until now, more than 3,000 hours have been transcribed using the product.

Arabic is the world’s fourth most popular language and is spoken in at least 60 countries.

 

 


Follow Us

  • YouTube
  • Twitter
  • Facebook
  • RSS Feed
  • Linkedin
  • github-web.png
Back to Top

In the Media

MIT Tamer.JPG

Taming data

22/01/2017

The age of big data has seen a host of new techniques for analyzing large data sets. But before any of those techniques can be applied, the target data has to be aggregated, organized, and cleaned up...

Read More

BBC deep learning story pic.PNG

What is 'deep learning'?

08/01/2017

Every day we create billions of bits of data. Ever faster and more powerful computers can use that big data to learn, predict events and carry out key tasks. Surveillance, voice recognition and ...

Read More

EN Facebook story.jpg

Así influye Facebook en tus opiniones

08/12/2016

Hay varios presuntos culpables de la victoria de Donald Trump: la falta de visión de la campaña de Clinton, la globalización, la condescendencia de las elites. Facebook y su modo de seleccionar ...

Read More

Events

2017

ArabWic for web.jpg

Women in Data Science

Download ICS File 03/02/2017 ,

Here's a great chance to learn about the latest data science-related research in multiple domains, as part of a global project. Qatar's WiDS event will be held here at the HBKU Research Complex on ...

Read More

MLDAS 2017

(MLDAS 2017 )Machine Learning and Data Analytics Symposium

Download ICS File 13/03/2017  - 14/03/2017 , Qatar National Convention Center

Machine Learning and Data Analytics Symposium - MLDAS 2017 Building on the success of MLDAS 2016 and MLDAS 2015 , The Third Machine Learning and Data Analytics (MLDAS) Symposium , will be held on ...

Read More

Past Events

2016

QCRI IBM New.JPG

QCRI - IBM Data Science Connect 2016

Download ICS File 16/11/2016 ,

QCRI–IBM Data Science Connect 2016  Doha, Qatar 12.30pm –5:30pm, Wednesday, November 16 HBKU Research Complex, Ground Level Multi-Purpose Room Google Map link to location https://goo.gl/maps/...

Read More

News

Jalees10.jpg

QCRI’s Jalees Reader app launched in more languages

06/12/2016

French and German interfaces added for free app which allows users to upload books and read them offline.

Read More

IBM Watson robot (ex IBM Watson).JPG

IBM Watson scientist visits Qatar to present platform that 'thinks like a human'

16/11/2016

IBM Watson’s chief data scientist Romeo Kienzler has visited the Qatar Computing Research Institute to conduct a workshop on Watson, a question-answering platform that can “think like a human”. Mr ...

Read More

Stephan.JPG

QCRI’s BrailleEasy app launched on iOS store

31/08/2016

BrailleEasy, a custom one-handed Braille keyboard developed by the Qatar Computing Research Institute, is now available on the iOS App Store. The Braille keyboard was developed by Barbara Šepič, ...

Read More