Date 09/13/24

New research on data quality's role in model efficiency presented by Imagimob AI Engineer Gustav Nilsson

Earlier this month, at the 9th International Conference on Fog and Mobile Edge Computing (FMEC 2024), our very own AI engineer, Gustav Nilsson presented his paper: The Role of the Data Quality on Model Efficiency: An Exploratory Study on Centralized and Federated Learning. Gustav wrote this paper in collaboration with Imagimob, as part of his studies in AI and Machine Learning to obtain a Master of Science in Engineering at Blekinge Institute of Technology (BTH).

On how the paper came about, Gustav says: "the core idea of the paper was a result of discussions with people at Imagimob and they supported me throughout the whole process!"

The whole paper will be published shortly, so stay tuned as we will share the link on our blog and on our LinkedIn in the near future. For now, you can read his abstract.

Abstract

This paper investigates the impact that datasets of varying data quality levels have on centralized vs. federated learning models using experiments. We also investigate how the distribution of low-quality data across federated clients affects the models' accuracy. Within the experiments we create datasets of increasingly worse data quality in terms of the following two data quality metrics; data accuracy and data completeness. This is done by perturbing (i.e., modifying) the datasets in order to decrease the quality of the datasets with regard to these two data quality metrics. Then, three experiments are conducted that investigates; i) the impact of decreased data accuracy on the models' performance, ii) the impact of decreased data completeness, and iii) the effects of different distribution low-quality data on the clients used in the federated learning setup. The results reveal that the centralized model achieves 60.3% validation accuracy with low data accuracy and 58.7% with low data completeness. While the federated model performs better, achieving 69.3% validation accuracy with low data accuracy and 79.2% with low data completeness. The federated model is less affected by low data quality if the data quality is distributed evenly between its clients. Further, the federated learning setup displays certain attributes that make it more robust to data with low quality, compared to centralized learning. Uneven distribution of data quality between clients has a more negative impact on federated learning compared to even distribution.

Be sure to subscribe to our newsletter so you don't miss the paper in it's entirety!

New research on data quality's role in model efficiency presented by Imagimob AI Engineer Gustav Nilsson

Abstract

LATEST ARTICLES

Starter Models are here! But what are they?

Generative AI on the Edge: What Does the Future Ho...

February 2025 Studio Release

4 Ways to Leverage Generative AI on the Edge

Delivering world class edge AI - watch the video

November release of DEEPCRAFT™ Studio

New research on data quality's role in model effic...

September Release of Imagimob Studio

Imagimob at tinyML Innovation Forum 2024

Imagimob Studio 5.0 has arrived!

May release of Imagimob Studio

2024 State of Edge AI Report

What is Edge AI?

March release of Imagimob Studio

What is tinyML?

February release of Imagimob Studio

Introducing Graph UX: A new way to visualize your ...

Imagimob Ready Models are here. Time to accelerate...

Deploying Quality SED models in a week

An introduction to Sound Event Detection (SED)

Imagimob condition monitoring AI-demo on Texas Ins...

Alert Vest – connected tinyML safety vest by Swanh...

Video recording from tinyML AutoML Deep Dive

Edge ML Project time-estimates

An introduction to Fall detection - The art of mea...

Imagimob to exhibit at Embedded World 2022

The past, present and future of Edge AI

Recorded AI Tech Talk by Imagimob and Arm on April...

The Future is Touchless: Radical Gesture Control P...

How to build an embedded AI application

Don’t build your embedded AI pipeline from scratch...

Imagimob @ CES 2022

Imagimob AI in Agritech

Deploying Edge AI Models - Acconeer example

Imagimob AI used for condition monitoring of elect...

Tips and Tricks for Better Edge AI models

DEEPCRAFT™ Studio (formerly Imagimob Studio) integ...

Recorded Webinar - Imagimob at Arm AI Tech Talks o...

Gesture Visualization in Imagimob Studio

New team members

Imagimob featured in Dagens Industri

Customer Case Study: Increasing car safety through...

Veoneer, Imagimob and Pionate in joint research pr...

Edge computing needs Edge AI

Imagimob video from tinyML Talks

Agritech: Monitoring cattle with IoT and Edge AI

Arm Community Blog: Imagimob - The fastest way fro...

Imagimob video from Redeye AI seminar

Webinar - Gesture control using radar and Edge AI

tinyML article with Nordic Semiconductors

Edge AI for techies, updated December 11, 2019

Article in Dagens Industri: This is how Stockholm-...

The New Path to Better Edge AI Applications

Edge Computing in Modern Agriculture

Our Top 3 Highlights from Hannover Messe 2019

The Way You Collect Data Can Make or Break Your Ne...

AI Research and AI Safety

Imagimob and Autoliv demo at CES 2018

Wearing Intelligence On Your Sleeve