Scaling Data Science Solutions with Semantics and Machine Learning: Bosch Case

08/02/2023
by   Baifan Zhou, et al.
0

Industry 4.0 and Internet of Things (IoT) technologies unlock unprecedented amount of data from factory production, posing big data challenges in volume and variety. In that context, distributed computing solutions such as cloud systems are leveraged to parallelise the data processing and reduce computation time. As the cloud systems become increasingly popular, there is increased demand that more users that were originally not cloud experts (such as data scientists, domain experts) deploy their solutions on the cloud systems. However, it is non-trivial to address both the high demand for cloud system users and the excessive time required to train them. To this end, we propose SemCloud, a semantics-enhanced cloud system, that couples cloud system with semantic technologies and machine learning. SemCloud relies on domain ontologies and mappings for data integration, and parallelises the semantic data integration and data analysis on distributed computing nodes. Furthermore, SemCloud adopts adaptive Datalog rules and machine learning for automated resource configuration, allowing non-cloud experts to use the cloud system. The system has been evaluated in industrial use case with millions of data, thousands of repeated runs, and domain users, showing promising results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2018

Machine learning for Internet of Things data analysis: A survey

Rapid developments in hardware, software, and communication technologies...
research
09/01/2023

Co-Tuning of Cloud Infrastructure and Distributed Data Processing Platforms

Distributed Data Processing Platforms (e.g., Hadoop, Spark, and Flink) a...
research
05/13/2021

Models of Computing as a Service and IoT: an analysis of the current scenario with applications using LPWAN

This work provides the basis to understand and select Cloud Computing mo...
research
03/12/2019

Proceedings of the Fifth International Conference on Cloud and Robotics (ICCR2018)

The 5th edition of the International Conference on Cloud and Robotics (I...
research
08/25/2022

Overbook in Advance, Trade in Future: Computing Resource Provisioning in Hybrid Device-Edge-Cloud Networks

The big data processing in distributed Internet of Things (IoT) systems ...
research
12/07/2021

In-Network Processing for Low-Latency Industrial Anomaly Detection in Softwarized Networks

Modern manufacturers are currently undertaking the integration of novel ...
research
07/10/2023

Model-Driven Engineering Method to Support the Formalization of Machine Learning using SysML

Methods: This work introduces a method supporting the collaborative defi...

Please sign up or login with your details

Forgot password? Click here to reset