QCRI launches RHEEM big data app
The Qatar Computing Research Institute (QCRI), a division of Hamad bin Khalifa University, has launched an application that can draw insights from big data across many software platforms.
The application, RHEEM, named after a native gazelle in Qatar, is the first general purpose system capable of what scientists call cross-platform data processing to be launched.
QCRI scientist Jorge-Arnulfo Quiané-Ruiz, who led the development of the software, said as the use and analysis of big data spread, organisations were using different specialised big data platforms to address specific needs.
“For example, social media and customer purchase analytics typically require separate software platforms,” Quiané-Ruiz said.
An airline could use RHEEM to mine data from a passenger database to target promotions to customers based on flight history data from information stored in a passenger list. RHEEM could then help the airline’s engineers to use this information to schedule extra flight capacity with data stored separately on multiple platforms. Qatar Airways has agreed to test the technology.
Hospitals could use the application to improve patient monitoring. For example, doctors might need to gather information about patients (whose information is typically stored in a database) who have received similar prescriptions, information usually stored in text files.
The oil industry could also use RHEEM to analyse structured and unstructured data to improve efficiency. During an exploration phase, for instance, data has to be acquired, integrated and analysed in order to predict if a reservoir will be profitable.
Quiané-Ruiz said RHEEM operated like a platform integrator or coordinator that used a set of rules to decide how to map a request from the application layer all the way down to the right platform. It works in a layered fashion, with the application sitting on the top layer and the bottom layer connecting with a myriad of popular platforms.
“Rheem can decide, for example that for certain tasks PostgreSQL is the right option but for others, distributed platforms like Hadoop or Spark could be more efficient,” Quiané-Ruiz said.
“The key challenge was to come up with a small set of rules that are not too complicated but at the same time could manage the diversity of data analysis tasks.”
RHEEM allows users to easily specify these tasks with easy-to-use interfaces, provides developers with opportunities to optimise performance in different ways and can run on any data-processing platform, such as PostgreSQL, Spark or GraphChi.
The software has been presented to scientists at leading international conferences including SIGMOD, VLDB and EDBT.
In the Media
This year CSAIL celebrates five years of collaboration with the Qatar Computing Research Institute (QCRI), an esteemed research institute that’s part of Hamad Bin Khalifa University in Doha. This ...
A new study has found what many of us have always thought to be true: We are more likely to accept correction from people we know than strangers. The study , conducted by researchers at Cornell, ...
There are few things social media users love more than flooding their feeds with photos of food. Yet we seldom use these images for much more than a quick scroll on our cellphones. Researchers from ...
Children and teenagers have been given a rare chance to develop their computing skills with world-class computing scientists at the first summer computing camp conducted by the Qatar Computing ...
The Qatar Computing Research Institute’s new Creative Space, which conducts fun activities to teach children computing skills, has successfully held its first Open House event. About 100 children ...
The QCRI – MIT CSAIL Annual Research Project Review is open to the public on Monday, March 27, 2017, at the HBKU Research Complex Multipurpose Room. The annual meeting is a highlight of a ...
Joint research undertaken by Dr. Ingmar Weber of Qatar Computing Research Institute, part of Hamad Bin Khalifa University, along with scientists from Oxford and Princeton universities, has won a ...