A Workbench based on NiFi: Linked Open Data at your Fingertips
Keywords: Linked Data, AI , Apache NiFi, Artificial Intelligence, Big Data, Data Processing, RML, cloud computing, pipelines
Supervision: Ben De Meester Anastasia Dimou
Students: max 1
Data is messy. Trying to combine raw data in a meaningful way so that you and potentially the world could benefit from it is even messier. Many Big Data technologies have been created to on the one hand create scalable data processing pipelines, on the other hand create and publish high-quality Linked Data. Multiple Fortune 500 companies have been set-up to process Big Data, but the connection with Linked Data generation is still missing.
In this Master Thesis, you will study existing Big Data tech such as Apache NiFi, AirBnB Airbyte, Stitch, and Pentaho, and investigate the missing link between these technologies and our state-of-the-art Linked Data Generation and Publishing technologies such as RML.io. You will study the trade-offs, design, and investigate a method to join these two worlds. The resulting insights drive our technologies into highly-scalable software stacks, increase the developer experience for Linked Open Data generation, and maybe even get you a Fortune 500 company.