DataCloud
DataCloud
ENABLING THE BIG DATA PIPELINE LIFECYCLE ON THE COMPUTING CONTINUUM
Full project details (EU Research results portal): https://cordis.europa.eu/project/id/101016835
Project description:
DataCloud provides a novel paradigm covering the complete lifecycle of managing Big Data pipelines through discovery, design, simulation, provisioning, deployment, and adaptation across the Computing Continuum. Big Data pipelines in DataCloud interconnect the end-to-end industrial operations of collecting pre-processing and filtering data, transforming and delivering insights, training simulation models, and applying them in the cloud to achieve a business goal. DataCloud delivers a toolbox of new languages, methods, infrastructures, and prototypes for discovering, simulating, deploying, and adapting Big Data pipelines on heterogeneous and untrusted resources. DataCloud separates the design from the run-time aspects of Big Data pipeline deployment, empowering domain experts to take an active part in their definitions. The main exploitation targets the operation and monetization of the toolbox in European markets, and in the Spanish-speaking countries of Latin America. Its aim is to lower the technological entry barriers for the incorporation of Big Data pipelines in organizations’ business processes and make them accessible to a wider set of stakeholders regardless of the hardware infrastructure. DataCloud validates its plan through a strong selection of complementary business cases offered by SMEs and a large company targeting higher mobile business revenues in smart marketing campaigns, reduced production costs of sport events, trustworthy eHealth patient data management, and reduced time to production and better analytics in Industry 4.0 manufacturing. The balanced consortium consists of 11 partners from eight countries. It has three strong university partners specialised in Big Data, distributed computing, and high-productivity languages, led by a research institute. DataCloud gathers six SMEs and one large company (as technology providers and stakeholders/users/early adopters) that prioritise the business focus of the project in achieving high business impacts.
EuroVoc IDs: /medical and health sciences/health sciences/health care services/eHealth
EU Programme: Horizon Europe
EU Project
Project publications:
| EU Project | Has Title | Has Category | Has Type | Has Year | Has DOI |
|---|---|---|---|---|---|
| DataCloud | Matching-based Scheduling of Asynchronous Data Processing Workflows on the Computing Continuum | Data Science, Analytics, and Data Processing | Conference proceedings | 2022 | https://doi.org/10.1109/cluster51413.2022.00021 |
| DataCloud | SimLess: Simulate Serverless Workflows and Their Twins and Siblings in Federated FaaS | Data Science, Analytics, and Data Processing | Conference proceedings | 2022 | https://doi.org/10.1145/3542929.3563478 |
| DataCloud | ExeKGLib: Knowledge Graphs-Empowered Machine Learning Analytics | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.48550/arxiv.2305.02966 |
| DataCloud | Comparison of Microservice Call Rate Predictions for Replication in the Cloud | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.1145/3603166.3632566 |
| DataCloud | Towards Graph-based Cloud Cost Modelling and Optimisation | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.1109/compsac57700.2023.00203 |
| DataCloud | VE-Match: Video Encoding Matching-based Model for Cloud and Edge Computing Instances | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.1145/3593908.3593943 |
| DataCloud | A Human-in-the-Loop Approach to Support the Segments Compliance Analysis | Data Science, Analytics, and Data Processing | Conference proceedings | 2022 | https://doi.org/10.1007/978-3-031-16168-1 13 |
| DataCloud | ContrastNER: Contrastive-based Prompt Tuning for Few-shot NER | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.1109/compsac57700.2023.00038 |
| DataCloud | Aligning Data-Aware Declarative Process Models and Event Logs | Data Science, Analytics, and Data Processing | Conference proceedings | 2021 | https://doi.org/10.1007/978-3-030-85469-0 16 |
| DataCloud | MPEC2: Multilayer and Pipeline Video Encoding on the Computing Continuum | Data Science, Analytics, and Data Processing | Conference proceedings | 2022 | https://doi.org/10.1109/nca57778.2022.10013519 |
| DataCloud | Supplier Optimization at Bosch with Knowledge Graphs and Answer Set Programming | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.1007/978-3-031-43458-7 38 |
| DataCloud | Machine Learning Based Resource Utilization Prediction in the Computing Continuum | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.5281/zenodo.10203854 |
| DataCloud | ExeKG: Executable Knowledge Graph System for User-friendly Data Analytics | Data Science, Analytics, and Data Processing | Conference proceedings | 2022 | https://doi.org/10.1145/3511808.3557195 |
| DataCloud | Addressing the Scalability Bottleneck of Semantic Technologies at Bosch | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.1007/978-3-031-43458-7 33 |
| DataCloud | An SQL-Based Declarative Process Mining Framework for Analyzing Process Data Stored in Relational Databases | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.1007/978-3-031-41623-1 13 |
| DataCloud | CNN-assisted Road Sign Inspection on the Computing Continuum | Data Science, Analytics, and Data Processing | Conference proceedings | 2022 | https://doi.org/10.1109/ucc56403.2022.00038 |
| DataCloud | Proactive SLA-aware Application Placement in the Computing Continuum | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.1109/ipdps54959.2023.00054 |
| DataCloud | Scaling Data Science Solutions with Semantics and Machine Learning: Bosch Case | Data Science, Analytics, and Data Processing | Conference proceedings | 2023 | https://doi.org/10.48550/arxiv.2308.01094 |
| DataCloud | SmartRPA: A Tool to Reactively Synthesize Software Robots from User Interface Logs. | Data Science, Analytics, and Data Processing | Conference proceedings | 2021 | https://doi.org/10.1007/978-3-030-79108-7 16 |
| DataCloud | Preemptive online scheduling in the Computing Continuum | Data Science, Analytics, and Data Processing | Conference proceedings | 2022 | https://doi.org/10.1109/ucc56403.2022.00057 |
| DataCloud | Discovering Declarative Process Model Behavior from Event Logs via Model Learning | Data Science, Analytics, and Data Processing | Conference proceedings | 2021 | https://doi.org/10.1109/icpm53251.2021.9576870 |