GithubHelp home page GithubHelp logo

smart-objects / impact-events-dataset Goto Github PK

View Code? Open in Web Editor NEW
13.0 2.0 0.0 108 KB

This repository contains a novel time-series dataset for impact detection and localization on a plastic thin-plate, towards Structural Health Monitoring applications, using ceramic piezoelectric transducers (PZTs) connected to an Internet of Things (IoT) device.

dataset shm arduino data-collection impact-analysis iot-data microcontroller piezoelectric-sensors pzt structural-health-monitoring

impact-events-dataset's Introduction

Impact-Events-Dataset

Full name - Dataset: Impact Events for Structural Health Monitoring of a Plastic Thin Plate

Status

  • Due to current git lfs limits, the dataset is also available and can be accessed from Zenodo.

  • Publications

    • The Impact Events data contained in this repository has been successfully accepted for publication as a dataset paper in the Fifth International Workshop on Data: Acquisition To Analysis (DATA '22).

    • In addition, the data are used in the paper "Smart Objects: Impact localization powered by TinyML" that is accepted for publication in the 4th International Workshop on Challenges in Artificial Intelligence and Machine Learning for Internet of Things (AIChallengeIoT 2022).

    • Both workshops are held in conjuction with the 20th ACM Conference on Embedded Networked Sensor Systems SenSys 2022

    • A preprint arxiv for the dataset paper has been announced (here).

Dataset outline

This repository contains a novel time-series dataset for impact detection and localization on a plastic thin-plate, towards Structural Health Monitoring applications, using ceramic piezoelectric transducers (PZTs) connected to an Internet of Things (IoT) device. The dataset was collected from an experimental procedure of low-velocity, low-energy impact events that includes at least 3 repetitions for each unique experiment, while the input measurements come from 4 PZT sensors placed at the corners of the plate. For each repetition and sensor, 5000 values are stored with 100 KHz sampling rate. The system is excited with a steel ball, and the height from which it is released varies from 10 cm to 20 cm.

Why - Motivation and Contribution

To the best of our knowledge, we are the first, to publish a public dataset that contains PZT sensors measurements concerning low-velocity, low-energy impact events in a thin plastic plate. In addition, we also contribute with our methodology on data collection using an SHM IoT system with resource constraints (based on Arduino NANO 33 MCU), as opposed to the majority of the literature that uses Oscilloscopes for data acquisition. This concept of an MCU-based system for data collection in SHM is especially important nowadays, due to the fast rise of extreme-edge and embedded machine learning practices solutions that enable a variety of real-time data-driven SHM applications. Finally, we wish to highlight that by using this specific Microcontroller Unit (MCU) and sensors, the proposed implementation aims for an overall low-cost data collection solution.

Dataset file structure

The directory of the Impact Events dataset is organized as follows:

  • DoE_Data folder contains one folder for each different height of the impact ball fall (ranging from 10.0 to 20.0 cm with an interval of 0.5 cm).

    • Then each of those folders contains one .csv file per experiment, based on the generated Sobol sequences (sobol_[Height]_[Sobol_ID].csv).
  • The merged_dataset.csv file contains the unified dataset from all the sensor measurements across all the experiments

  • Feature_extraction folder contains the features.json file that includes the index of the statistical features to be extracted (see the following section) and feature_augmented_dataset.csv that contains the extracted features (as columns) and the rest of impact information (x,y, height, position) per experiment (rows).

Feature extraction list

The complete list of extracted features is the following:

Feature Description
Energy Energy of the time series
Absolute Maximum Highest absolute value
Absolute Sum of Changes Sum of absolute values of consecutive changes
C3 Measurement of nonlinearity
Count Above Mean Number of values greater than the mean
Count Below Mean Number of values lesser than the mean
Energy Ratio by Chunks Ratio of the $i$-th chunk's energy over the energy of N chunks
First Location of Maximum Index of the maximum value's first appearance relative to the length of the time series
First Location of Maximum Index of the minimum value's first appearance relative to the length of the time series
Kurtosis Signal's tails measurement relative to normal distribution
Large Standard Deviation Check if the Standard Deviation is $r$ times greater than $max - min$
Longest Strike Above Mean Number of consecutive values above mean
Longest Strike Below Mean Number of consecutive values below mean
Maximum Highest Value
Mean Mean of the time series
Mean of Absolute Change Mean value of absolute change
Mean Change Average of differences of subsequent values
Median Median of the time series
Minimum Lowest Value
Skewness Calculates the lack of symmetry
Standard Deviation Measurement of time series's dispersion relative to the mean
Symmetry Looking Checks the symmetry of the time series
Variance Variability of the time series
Variance Larger than Standard Deviation Checks if the variance is greater than standard deviation
Quantile Calculates the q quantile of x
Ratio beyond r_sigma Ratio of values that are more than r * std(x) (so r times sigma) away from the mean of x
Sum of reoccuring data points Returns the sum of all data points, that are present in the time series more than once
Sum of reoccuring values Returns the sum of all values, that are present in the time series more than once
Ratio to time series length Returns a factor which is 1 if all values in the time series occur only once, and below one if this is not the case
Percentage of reoccuring datapoints Returns the percentage of non-unique data points
Percentage of reoccuring values Returns the percentage of values that are present in the time series more than once
Benford correlation Returns the correlation from first digit distribution when compared to the Newcomb-Benford’s Law distribution

How to cite

If you use the Impact Events dataset in your work, please make the corresponding citation.

Research papers concerning the dataset (recommended):

  • Conference paper: @inproceedings{dataset_sensys, title={Dataset: Impact events for Structural Health Monitoring of a thin plate}, author = {Katsidimas, Ioannis and Kotzakolios, Thanasis and Nikoletseas, Sotiris and Panagiotou, Stefanos H. and Timpilis, Konstantinos and Tsakonas, Constantinos}, booktitle={Proceedings of the 20th ACM Conference on Embedded Networked Sensor Systems}, year={2022}, doi = {https://doi.org/10.1145/3560905.3567764} }

  • Arxiv preprint: @misc{https://doi.org/10.48550/arxiv.2209.10018, doi = {10.48550/ARXIV.2209.10018}, url = {https://arxiv.org/abs/2209.10018}, author = {Katsidimas, Ioannis and Kotzakolios, Thanasis and Nikoletseas, Sotiris and Panagiotou, Stefanos H. and Timpilis, Konstantinos and Tsakonas, Constantinos}, keywords = {Machine Learning (cs.LG), FOS: Computer and information sciences, FOS: Computer and information sciences}, title = {Dataset: Impact Events for Structural Health Monitoring of a Plastic Thin Plate}, publisher = {arXiv}, year = {2022}, copyright = {Creative Commons Attribution Non Commercial Share Alike 4.0 International}}

Github repository citation:

  • APA: Katsidimas, I., Kotzakolios, T., Nikoletseas, S., Panagiotou, S. H., Timpilis, K., & Tsakonas, C. Impact Events for Structural Health Monitoring of a Plastic Thin Plate [Data set]

  • BibTeX: @misc{Katsidimas_Impact_Events_for, author = {Katsidimas, Ioannis and Kotzakolios, Thanasis and Nikoletseas, Sotiris and Panagiotou, Stefanos H. and Timpilis, Konstantinos and Tsakonas, Constantinos}, title = {{Impact Events for Structural Health Monitoring of a Plastic Thin Plate}}}

License

Creative Commons Attribution-NonCommercial-ShareAlike (CC-BY-NC-SA):

A creative commons license that bans commercial use and requires you to release any modified works under this license.

Authors

  • Ioannis Katsidimas
  • Thanasis Kotzakolios
  • Sotiris Nikoletseas
  • Stefanos H. Panagiotou
  • Konstantinos Timpilis
  • Constantinos Tsakonas

impact-events-dataset's People

Contributors

iamtsac avatar stefanosh avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

impact-events-dataset's Issues

Can't download the datasets

I want to try the datasets for impact identification. But downloading the dataset is quite troublesome.

I use git to access the dataset stored in lsf. But there is an error:

This repository is over its data quota. Account responsible for LFS bandwidth should purchase more data packs to restore access.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.