GithubHelp home page GithubHelp logo

pablomm / tfg Goto Github PK

View Code? Open in Web Editor NEW
2.0 3.0 0.0 96.79 MB

Bachelor's thesis: Functional data analysis: interpolation, registration, and nearest neighbors in scikit-fda

Home Page: https://github.com/GAA-UAM/scikit-fda

License: Other

Python 30.18% TeX 69.82%
bachelor-thesis functional-data-analysis scikit-fda universidad-autonoma-de-madrid trabajo-fin-grado

tfg's Introduction

Functional data analysis: interpolation, registration, and nearest neighbors in scikit-fda

License: CC BY-NC-SA 4.0

Bachelor's thesis to obtain a double degree in Computer Science and Mathematics at the Autonomous University of Madrid. This repository contains the files used to generate the undergraduate thesis document. The contributions made in this work can be found in the scikit-fda project repository.

Abstract

Functional Data Analysis (FDA) is a branch of Statistics devoted to the study of random quantities that depend on a continuous parameter, such as time series or curves in space. In FDA the data instances can be viewed as random functions sampled from an underlying stochastic process.

In this work we consider three different tasks in FDA: the use of interpolation techniques to estimate the values of the functions at unobserved points, the registration of these type of data, and the solution of classification and regression problems in which the instances are characterized by functional attributes. In particular, in this project the scikit-fda package for FDA in Python has been extended with functionality in these areas.

Generally, the data instances considered in FDA consist of a collection of observations at a discrete values of the parameter on which they depend (e.g. time or space). For some applications it is convenient, and in some cases necessary, to estimate the value of these functions at unobserved points. This can be achieved through the use of interpolation from the available measurements.

In some applications, the functions observed have similar shapes, but exhibit variability whose origin can be traced to distortions in the scale of the continuous parameter on which the data depend. Registration consists in characterizing this variability and eliminating it from the sample considered.

In this work we also address classification and regression problems with data that are characterized by functions. Specifically, we design nearest neighbors estimators based on the notion of closeness among samples.

Specifically, in this work the scikit-fda package has been extended to include interpolation methods based on splines. The package has also been endowed with tools for data registration using either shifts, landmark alignment, or elastic registration, which makes use of the Fisher-Rao metric to align the functions in a sample. In addition, models based on nearest neighbors have been included to carry out regression, with both scalar and functional response, and classification.

tfg's People

Contributors

pablomm avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.