GithubHelp home page GithubHelp logo

crack-fettuccini / podcastalytics Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 4.29 MB

Data analytics tool that combines the data analytics capabilities of R with the code simplicity of python and speed of C to give an overview of the podcasting genre

R 14.11% C 78.76% Python 7.14%

podcastalytics's Introduction

PodCastalytics

Podcast consumer analysis analytics

PodCastalytics is a robust data analytics tool designed for new podcasters aiming to understand the current podcast scene. It leverages the data analysis capabilities of R, the code readability of Python, and the speed of C to provide insightful metrics.

Usage Instructions

  1. Run pods_reduced.R to initiate the analysis. Note that this process may take some time as each podcast recording is downloaded and individually analyzed for voice tone and other metrics.

  2. Run graphr.R to visualize the evaluated metrics.

Technologies Used

PodCastalytics uses a number of tools to run effectively

  • R: Utilized for visualizing evaluated metrics.
  • Python: Chosen for its code simplicity.
  • C: Enhances the evaluation speed.
  • TmpDisk: Facilitates large rewrites by hosting a TMPFS in-memory, removing strain on persistent memory.
  • Vokaturi: Simplifies sentiment and tone analysis of audio files.
  • ffmpeg: Assists in converting between audio formats.
  • librosa: Audio analytics library for collecting information about audio files.

Testing Methods and Spotify API

Additional testing methods are included in the Data_exploration folder.

To use the Spotify API, obtain API keys from Spotify Developer.

To customize the podcasts analysed, add or replace the list of pagelinks in pods_reduced.R or pods.R with RSS feeds of podcasts that you want to analyse.

File System Configuration

Set the .tempfolder to a TMPFS or RAMFS filesystem. Frequent rewrites of large data amounts can cause wear on persistent memory. Ensure the file isn't written to a hard drive/SSD to prevent wear.

Important Note:

Web Scraping Warning: Please respect the TOS of the websites you are scraping, especially when working with RSS feeds. Unauthorized or excessive scraping may lead to IP blocks or legal consequences. Just make sure your analysis is not affecting another users ability to listen to the podcast. We are all here because of our hobbies, so be careful to not take away someone else's fun.

Dataset Generation Time: The process of downloading podcast recordings and generating the dataset is a time-intensive task. Each podcast recording is individually analyzed for voice tone and other metrics, contributing to the extended duration. Even with a high speed internet, the download of each file may be rate-limited. It might take several days to completely process the data depending on the number, size and episodes of podcasts to be analysed.

podcastalytics's People

Contributors

crack-fettuccini avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.