aws-big-data-projects Goto Github PK
Name: Big Data Journal Projects
Type: Organization
Bio: This Projects are done under Cloud Tech and BigdataJournal Community Group
Twitter: thebigdatajour
Location: Vadodara
Name: Big Data Journal Projects
Type: Organization
Bio: This Projects are done under Cloud Tech and BigdataJournal Community Group
Twitter: thebigdatajour
Location: Vadodara
This repository provides Code examples written in Python,Spark-Scala using primarily boto3 SDK API methods and aws cli examples for majority of the AWS Big Data services. There are also nicley written Wiki articles for most of the common issues/challenges faced within BigData world.
A cross-platform (Windows, MAC, Linux) desktop application to view common bigdata binary format like Parquet, ORC, AVRO, etc. Support local file system, HDFS, AWS S3, Azure Blob Storage ,etc.
A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL
Player Unknown's Battlegrounds (PUBG), is a first person shooter game where the goal is to be the last player standing. You are placed on a giant circular map that shrinks as the game goes on, and you must find weapons, armor, and other supplies in order to kill other players / teams and survive.
Construct a modern data stack and orchestration the workflows to create high quality data for analytics and ML applications.
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
This repository hold the Amazon Elastic MapReduce sample bootstrap actions
Example code for running Spark and Hive jobs on EMR Serverless.
This repository contains ready-to-use notebook examples for a wide variety of use cases in Amazon EMR Studio.
Ferry lets you define, run, and deploy big data applications on AWS, OpenStack, and your local machine using Docker
A simple, practical, and affordable system for measuring head trauma within the sports environment, subject to the absence of trained medical personnel made using Amazon Kinesis Data Streams, Kinesis Data Analytics, Kinesis Data Firehose, and AWS Lambda
you run a script to mimic multiple sensors publishing messages on an IoT MQTT topic, with one message published every second. The events get sent to AWS IoT, where an IoT rule is configured. The IoT rule captures all messages and sends them to Firehose. From there, Firehose writes the messages in batches to objects stored in S3. In S3, you set up a table in Athena and use QuickSight to analyze the IoT data.
Hopsworks - Data-Intensive AI platform with a Feature Store
In this project, a framework is developed leveraging the capabilities of artificial neural networks to “caption an image based on its significant features”.
Iot,Big Data Analytics using Apache-kafka,spark and other aws services
Build a Visualization and Monitoring Dashboard for IoT Data with Amazon Kinesis Analytics and Amazon QuickSight
Collect, process, and analyze log data using Amazon Kinesis and Elasticsearch Service
AWS Retail Demo Store is a sample retail web application and workshop platform demonstrating how AWS infrastructure and services can be used to build compelling customer experiences for eCommerce, retail, and digital marketing use-cases
Run a Spark job within Amazon EMR
Python CDK serverless data pipeline with CI/CD process and Slack notifications.
Serverless Data Pipeline powered by Kinesis Firehose, API Gateway, Lambda, S3, and Athena
Simplify Big Data Analytics with Amazon EMR, published by Packt
Spark Examples
This project provides Apache Spark SQL, RDD, DataFrame and Dataset examples in Scala language
A comprehensive Spark guide collated from multiple sources that can be referred to learn more about Spark or as an interview refresher.
A solutions that automatically configures the AWS services necessary to easily capture, store, process, and deliver streaming data. This solution helps you solve for real-time streaming use cases like capturing high volume application logs, analyzing clickstream data, continuously delivering to a data lake, and more.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.