This is a collection of many frameworks and tools for software development.
Any recommendations and suggestions are welcomed.
- Architecture
- Cloud Providers
- Big Data
- Database
- Data Warehouse
- Storage
- Distributed File System
- BI Tools
- Parallel Programming
- Frameworks & Tools
- DevOps
- Security
- Mobile Frameworks
- Game Engines
- AI Tools
- IoT
- UML Tools
- Summit Events
- Algorithms
- Microservices - Microservices is a specialisation of and implementation approach for distributed architectures used to build flexible, independently deployable software systems.
- Reactive Programming - Reactive programming is about non-blocking applications that are asynchronous and event-driven and require a small number of threads to scale.
- Lambda Architecture - Lambda architecture is a data-processing architecture designed to handle massive quantities of data by taking advantage of both batch- and stream-processing.
- CQRS - Command and Query Responsibility Segregation (CQRS) is a pattern that segregates the operations that read data (Queries) from the operations that update data (Commands) by using separate interfaces.
- Event Sourcing - Event Sourcing persists each business entity as a sequence of events.
- Materialized View - A materialized view is a database object that contains the results of a query.
- Amazon Web Services (AWS) - Amazon Web Services offers reliable, scalable, and inexpensive cloud computing services. Free to join, pay only for what you use.
- Microsoft Azure - Microsoft Azure is an open, flexible, enterprise-grade cloud computing platform.
- Google Cloud Platform - Google Cloud Platform lets you build and host applications and websites, store data, and analyze data on Google's scalable infrastructure.
- IBM Bluemix - Bluemix is an open standards, cloud platform for building, running, and managing apps and services.
- OpenStack - OpenStack software controls large pools of compute, storage, and networking resources throughout a datacenter.
- Apache CloudStack - Apache CloudStack is open source software designed to deploy and manage large networks of virtual machines.
- Cloud Foundry - Cloud Foundry is an open source cloud computing platform as a service (PaaS) originally developed by VMware and now overseen by the Cloud Foundry.
- DigitalOcean - DigitalOcean is a simple and robust cloud computing platform, designed for developers.
- Heroku - Heroku is a platform as a service (PaaS) that enables developers to build, run, and operate applications entirely in the cloud.
- SAP HANA Cloud Platform - SAP HANA Cloud Platform is an open platform-as-a-service that provides unique in-memory database and application services.
- Oracle Cloud - Oracle Cloud Platform as a Service (PaaS) helps enterprise IT and independent software vendor (ISV) developers rapidly build and deploy rich applications.
- Amazon EMR - Amazon EMR simplifies big data processing, providing a managed Hadoop framework.
- Cloudera CDH - CDH is Cloudera's software distribution containing Apache Hadoop and related projects. All components are 100% open source.
- Hortonworks HDP - Hortonworks' product named Hortonworks Data Platform (HDP) includes Apache Hadoop and is used for storing, processing, and analyzing large volumes of data.
- IBM BigInsights - IBM provides a complete solution of Hadoop, including Spark, to scale analytics quickly and easily. Available on-premises, on-cloud, and integrated with other systems in use today.
- MapR - The MapR Converged Data Platform is the industry's only platform to integrate the enormous power of Hadoop and Spark with global event streaming, real-time.
- Pivotal Big Data Suite - Pivotal Big Data Suite provides the flexibility to choose and adopt proven, open source, scale-out databases, including: Pivotal Greenplum, Pivotal HDB.
- Databricks - Founded by the creators of Apache Spark, Databricks makes big data analytics simple through an integrated workspace hosted as a service in the cloud.
- Concurrent - One of the distributor in Hadoop world.
- Apache Kafka - Kafka™ is used for building real-time data pipelines and streaming apps.
- Apache Flume - Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data.
- RabbitMQ - RabbitMQ is open source message broker software that implements the Advanced Message Queuing Protocol (AMQP).
- Mosquitto - Mosquitto is an open source message broker that implements the MQTT (MQ Telemetry Transport) protocol v3.1.
- Logstash - Logstash is an open source, server-side data processing pipeline that ingests data from a multitude of sources simultaneously.
- Spring XD - Spring XD is a unified, distributed, and extensible service for data ingestion, real time analytics, batch processing, and data export.
- Confluent Platform - The free, open-source streaming platform based on Apache Kafka. Confluent Platform is the best way to get started with real-time data streams.
- Apache Storm - Apache Storm is a free and open source distributed realtime computation system.
- Apache Spark Streaming - Spark Streaming brings Apache Spark's language-integrated API to stream processing, letting you write streaming jobs the same way you write batch jobs.
- Apache Flume Interceptor - Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data.
- Apache Samza - Apache Samza is a distributed stream processing framework. It uses Apache Kafka for messaging, and Apache Hadoop YARN to provide fault tolerance.
- Apache Gearpump - Apache Gearpump is a real-time big data streaming engine.
- Spring XD - Spring XD is a unified, distributed, and extensible service for data ingestion, real time analytics, batch processing, and data export.
- Apache Spark - Apache Spark™ is a fast and general engine for large-scale data processing.
- Apache MapReduce - The Apache Hadoop MapReduce software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.
- Spring XD - Spring XD is a unified, distributed, and extensible service for data ingestion, real time analytics, batch processing, and data export.
- Apache Flink - Apache Flink® is an open source platform for distributed stream and batch data.
- Apache Mesos - Apache Mesos abstracts CPU, memory, storage, and other compute resources away from machines (physical or virtual), enabling fault-tolerant and elastic distributed systems to easily be built and run effectively.
- Mesosphere - Mesosphere Enterprise DC/OS is an enterprise grade datacenter-scale operating system, providing a single platform for running containers, big data.
- Apache Yarn - The Apache Hadoop Yarn software library is a framework that allows for the distributed processing of large data sets across clusters of computers using simple programming models.
- Apache Impala - Apache Impala is the open source, native analytic database for Apache Hadoop. Impala is shipped by Cloudera, MapR, Oracle, and Amazon.
- Apache Hive - Apache Hive is a data warehouse infrastructure built on top of Hadoop for providing data summarization, query, and analysis.
- Spark SQL - Spark SQL is a Spark module for structured data processing.
- Apache Drill - Drill supports a variety of NoSQL databases and file systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage.
- Presto - Presto is an open source distributed SQL query engine for running interactive analytic queries against data sources of all sizes ranging.
- Hive on Apache Tez - The Apache Tez™ project is aimed at building an application framework which allows for a complex directed-acyclic-graph of tasks for processing data.
- Apache Phoenix - Apache Phoenix takes your SQL query, compiles it into a series of HBase scans.
- Apache HAWQ - Apache HAWQ (incubating) combines exceptional MPP-based analytics performance, robust ANSI SQL compliance, Hadoop ecosystem.
- IBM Big SQL - IBM Big SQL is a data warehouse system for Hadoop that you use to summarize, query, and analyze data.
- Apache Kylin - Apache Kylin™ is an open source Distributed Analytics Engine.
- Apache Mahout - The Apache Mahout™ project's goal is to build an environment for quickly creating scalable performant machine learning applications.
- Spark MLlib - MLlib is Apache Spark's scalable machine learning library.
- H2O - H2O is open-source software for big-data analysis.
- Apache MADlib - Apache MADlib (incubating): Big Data Machine Learning in SQL.
- Weka - Waikato Environment for Knowledge Analysis (Weka) is a popular suite of machine learning software written in Java, developed at the University of Waikato.
- Scikit Learn - An open source Python library that implements a range of machine learning, preprocessing, cross-validation and visualization algorithms.
- Apache SystemML - Apache SystemML provides an optimal workplace for Machine Learning using big data.
- PyData - PyData is a gathering of users and developers of data analysis tools in Python.
- Jupyter - Open source, interactive data science and scientific computing across over 40 programming languages.
- Zeppelin - Apache Zeppelin interpreter concept allows any language/data-processing-backend to be plugged into Zeppelin.
- RStudio - A powerful and productive user interface for R.
- DataStax Enterprise - DataStax powers the big data applications that transform business and profoundly improve customer experiences through Apache Cassandra™.
- Amazon DynamoDB - Amazon DynamoDB is a fully managed NoSQL database service that provides fast and predictable performance with seamless scalability.
- Cassandra - The Apache Cassandra database is the right choice when you need scalability and high availability without compromising performance.
- Apache HBase - Apache HBase™ is the Hadoop database, a distributed, scalable, big data store. Use Apache HBase™ when you need random, realtime read/write access.
- Apache Accumulo - Apache Accumulo™ is a sorted, distributed key/value store that provides robust, scalable data storage and retrieval.
- Riak KV - Riak® KV is a distributed NoSQL key-value database with advanced local and multi-cluster replication that guarantees reads and writes even in the event of hardware failures or network partitions.
- MongoDB - MongoDB for GIANT Ideas - Build innovative modern applications that create a competitive advantage.
- CouchDB - Apache CouchDB is open source database software that focuses on ease of use and having an architecture that "completely embraces the Web".
- Couchbase - NoSQL database is developed by Couchbase.
- DocumentDB - DocumentDB is a fully managed NoSQL database service built for fast and predictable performance, high availability, elastic scaling.
- Elasticsearch - Elasticsearch is a distributed, RESTful search and analytics engine capable of solving a growing number of use cases.
- Apache Solr - Solr is highly reliable, scalable and fault tolerant, providing distributed indexing, replication and load-balanced querying.
- Splunk - Splunk Inc. provides the leading platform for Operational Intelligence. Customers use Splunk to search, monitor, analyze and visualize machine data.
- Neo4j - Neo4j is a graph database management system developed by Neo Technology, Inc.
- Titan - Titan is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across clusters.
- GraphX - GraphX is Apache Spark's API for graphs and graph-parallel computation, with a built-in library of common algorithms.
- NouDB - NuoDB is a database startup company based in Cambridge, Massachusetts. It sells a NewSQL database that works in the cloud.
- FoundationDB - FoundationDB was a multi-model NoSQL database with a shared nothing architecture.
- Oracle Database In-Memory - Oracle Database In-Memory delivers leading-edge in-memory performance without the need to restrict functionality or accept compromises, complexity and risk.
- DB2 BLU - BLU Acceleration is revolutionary in-memory technology that is designed for high-performance analytics and data-intensive reporting.
- IBM dashDB - IBM dashDB offers fully-managed, SQL database services.
- Hekaton - Hekaton is a new database engine optimized for memory resident data and OLTP workloads.
- Apache Kudu - Apache Kudu completes Hadoop's storage layer to enable fast analytics on fast data.
- Redis - Redis is an open source (BSD licensed), in-memory data structure store, used as database, cache and message broker.
- VoltDB - VoltDB is the world's fastest in-memory operational database - allowing you to ingest data, analyze data, and act on data in milliseconds with real-time experience.
- MemcacheDB - MemcacheDB is a distributed key-value storage system designed for persistent.
- Pivotal GemFire - Pivotal GemFire is an in-memory distributed data grid for high scale custom applications.
- Apache Geode - Apache Geode is a distributed, in-memory database with strong data consistency.
- H2 - H2 is a relational database management system written in Java. It can be embedded in Java applications or run in the client-server mode.
- Hazelcast - Hazelcast is the leading in-memory data grid solution.
- Ehcache - Ehcache is an open source, standards-based cache that boosts performance, offloads your database, and simplifies scalability.
- Infinispan - Infinispan is a distributed cache and key-value NoSQL data store software developed by Red Hat.
- GridGain - The GridGain Enterprise Edition includes valuable features added to Apache® Ignite™ which make deploying and maintaining a high performance in-memory.
- Apache Ignite - Apache Ignitetm In-Memory Data Fabric is a high-performance, integrated and distributed in-memory platform for computing and transacting.
- JCS - JCS is a distributed caching system written in Java.
- Event Store - The open-source, functional database with Complex Event Processing in JavaScript.
- RocksDB - RocksDB is an embeddable persistent key-value store for fast storage.
- LevelDB - LevelDB is a simple key/value data store built by Google, inspired by BigTable.
- SQLite - SQLite is a relational database management system contained in a C programming library.
- Berkeley DB - Berkeley DB is a family of embedded key-value database libraries providing scalable high-performance data management services to applications.
- JavaDB - Java DB is Oracle's supported distribution of the Apache Derby open source database.
- Apache Derby - Apache Derby, an Apache DB subproject, is an open source relational database implemented entirely in Java and available under the Apache License.
- Riak TS - Riak® TS is the only enterprise-grade NoSQL time series database optimized specifically for IoT and Time Series data.
- InfluxDB - InfluxDB is an open-source time series database developed by InfluxData as part of their time series platform. It is written in Go and optimized for fast.
- Redshift - Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse service.
- Vertica - HPE Vertica 8 introduces a unified architecture and advanced in-database analytics capabilities that enable users to conduct sophisticated analysis at industry-leading scale and speed.
- Pivotal Greenplum - Pivotal Greenplum is a commercial fully featured data warehouse powered by the open source Greenplum Database.
- InfiniDB - Column Database Accelerates Insights for Analytics, BI, and Data Warehouse.
- Druid - Druid supports fast aggregations and sub-second OLAP queries.
- Amazon Simple Storage Service (S3) - Amazon Simple Storage Service (Amazon S3), provides developers and IT teams with secure, durable, highly-scalable cloud storage.
- Riak S2 - Riak® S2 is a highly available, scalable, easy-to-operate object storage software solution that’s optimized for holding videos, images, and other files.
- DSEFS - DSEFS (DataStax Enterprise file system) is a new distributed file system within DataStax Enterprise.
- Tableau - Tableau is data visualization software.
- Pentaho - Pentaho's big data integration and analytics solutions turn information into insights to help your organization gain a competitive advantage.
- Qlik - Qlik delivers Business Intelligence software for data visualization, guided analytics, embedded analytics and reporting to over 40000 customers worldwide.
- Birt - BIRT is an open source software project that provides the BIRT technology platform to create data visualizations and reports.
- JasperReports® - The JasperReports Library is the world's most popular open source reporting engine.
- SAS Visual Analytics - Robust reporting and mobile BI.
- IBM Watson Analytics - Watson Analytics guides analysis with automated data visualization and discovery so you can uncover insights on your own.
- Actor Model - The actor model in computer science is a mathematical model of concurrent computation that treats "actors" as the universal primitives of concurrent computation.
- Communicating sequential processes - Communicating sequential processes (CSP) is a formal language for describing patterns of interaction in concurrent systems. It is a member of the family of mathematical theories of concurrency known as process algebras, or process calculi, based on message passing via channels.
- Akka - Akka is a toolkit and runtime for building highly concurrent, distributed, and resilient message-driven applications on the JVM.
- Reactor - Reactor is a fully non-blocking foundation with efficient demand management.
- ReactiveX - ReactiveX is a combination of the best ideas from the Observer pattern, the Iterator pattern, and functional programming.
- Erlang - Erlang is a programming language used to build massively scalable soft real-time systems with requirements on high availability.
- Goroutines - A goroutine is a function that is capable of running concurrently with other functions.
- Vert.x - Vert.x is a tool-kit for building reactive applications on the JVM.
- Play Framework - Play Framework makes it easy to build web applications with Java & Scala. Play is based on a lightweight, stateless, web-friendly architecture. Built on Akka.
- Spring Web Reactive - Reactive programming is about non-blocking applications that are asynchronous and event-driven and require a small number of threads to scale.
- Lagom - Lagom is a framework for creating reactive microservice-based systems.
- Play Framework - Play Framework makes it easy to build web applications with Java & Scala. Play is based on a lightweight, stateless, web-friendly architecture. Built on Akka.
- Spring Boot - Spring Boot makes it easy to create stand-alone, production-grade Spring based Applications that you can "just run".
- Dropwizard - Dropwizard is a Java framework for developing ops-friendly, high-performance, RESTful web services.
- Sparkjava - Spark Framework - Create web applications in Java rapidly. Spark is a micro web framework that lets you focus on writing your code, not boilerplate code.
- Ganglia - Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and Grids.
- Nagios - Nagios provides enterprise-class Open Source IT monitoring, network monitoring, server and applications monitoring.
- Datadog - See metrics from all of your apps, tools & services in one place with Datadog's cloud monitoring as a service solution.
- New Relic - A software analytics tool suite used by developers, ops, and software companies to understand how your applications are performing in development.
- Perfino - Perfino is a zero-overhead APM solution for monitoring Java application servers.
- Sensu - Monitor servers, services, application health, and business KPIs.
- OverOps - Know why Java code fails in production.
- HAProxy - The Reliable, High Performance TCP/HTTP Load Balancer.
- Tyk - Tyk is an open source API Gateway that is fast, scalable and modern.
- WSO2 - WSO2 provides the open source enterprise platform that helps to build, integrate, analyse and manage your APIs, applications, and Web services.
- JUnit - JUnit is a simple framework to write repeatable tests.
- Mockito - A mocking framework for unit tests written in Java.
- TestNG - TestNG is a testing framework developed in the lines of JUnit and NUnit.
- DbUnit - A JUnit extension that puts a database into a known state between test runs.
- Selenium - Selenium is a suite of tools to automate web browsers across many platforms.
- Cucumber - Cucumber is a software tool that computer programmers use for testing other software.
- SoapUI - SoapUI, is the world leading Open Source Functional Testing tool for API Testing.
- LoadUI - LoadUI, a Performance Load Testing tool for APIs & Web Services.
- Secure Pro - Simulate attacks against your REST and SOAP services so you know they're safe. Build a Trusted API with Secure Pro, Based on The world's Most Trusted API.
- SonarQube - SonarQube is an open platform to manage code quality.
- Gerrit - Gerrit provides web based code review and repository management for the Git version control system.
- PMD - PMD is a source code analyzer.
- Checkmarx - Checkmarx is a provider of state-of-the-art application security solution: static code analysis software, seamlessly integrated into development process.
- Flyway - Flyway lets you regain control of your database migrations with pleasure and plain sql.
- Liquibase - Liquibase is an open source database-independent library for tracking, managing and applying database schema changes.
- RedGate - Redgate's DLM(Database Lifecycle Management) solution helps you put in place a trusted, scalable and repeatable database change management process.
- Apache James - The Apache Java Mail Server is a 100% pure Java SMTP, IMAP4 and POP3 Mail server designed to be a complete and portable enterprise mail.
- JProfiler - JProfiler is the leading Java Profiler for profiling on the JVM.
- Docker - Docker is an open platform for developers and sysadmins to build, ship, and run distributed applications, whether on laptops, data center VMs, or the cloud.
- Ansible - Ansible is the simplest way to automate apps and IT infrastructure.
- Jenkins - Jenkins is an open source automation server written in Java.
- Bamboo - Continuous delivery, from code to deployment. ... Focus on coding and count on Bamboo as your CI and build server.
- Chef - Chef is an open source software agent that automates your infrastructure by turning it into code.
- Puppet - Puppet is an open-source configuration management tool.
- OWASP - The Open Web Application Security Project (OWASP) is an online community which creates freely-available articles, methodologies, documentation, tools.
- OAuth - OAuth is an open standard for authorization, commonly used as a way for Internet users to authorize websites or applications to access their information on other.
- SAML - Security Assertion Markup Language (SAML, pronounced sam-el) is an XML-based, open-standard data format for exchanging authentication and authorization data.
- MIT KDC - Kerberos is a network authentication protocol. It is designed to provide strong authentication for client/server applications by using secret-key cryptography.
- OpenLDAP - OpenLDAP Software is an open source implementation of the Lightweight Directory Access Protocol.
- Active Directory - Active Directory (AD) is a directory service that Microsoft developed for Windows domain networks.
- Xamarin - Xamarin's mobile application development platform with native user interfaces enables sharing of code across all platforms with a single C# codebase.
- React Native - Build Native Mobile Apps using JavaScript and React. React Native lets you build mobile apps using only JavaScript. It uses the same design as React.
- Apache Cordova - Apache Cordova (formerly PhoneGap) is a popular mobile application development framework originally created by Nitobi.
- Unity3D - Unity3D is a cross-platform game engine developed by Unity Technologies and used to develop video games for PC, consoles, mobile devices and websites.
- Unreal Engine - Unreal Engine 4 is a suite of integrated tools for game developers to design and build games, simulations, and visualizations.
- IBM Watson - IBM Watson is a technology platform that uses natural language processing and machine learning to reveal insights from large amounts of unstructured data.
- OpenCV - OpenCV (Open Source Computer Vision) is a library of programming functions mainly aimed at real-time computer vision.
- Brillo - Brillo brings the simplicity and speed of software development to hardware for IoT with an embedded OS, core services, developer kit, and developer console.
- mbed OS - ARM mbed OS is a platform operating system designed for the internet of things.
- Draw.io - Draw.io is free online diagram software for making flowcharts, process diagrams, org charts, UML, ER and network diagrams.
- Microsoft Visio - Microsoft Visio (formerly Microsoft Office Visio) is a diagramming and vector graphics application and is part of the Microsoft Office family.
- Enterprise Architect - Enterprise architects are practitioners of enterprise architecture; an enterprise strategic management discipline that operates within organizations.
- Web Summit - It's been called 'the best technology conference on the planet'.
- AWS Summit - Whether you are new to the cloud or an experienced user, you will learn something new at an AWS Summit.
- Gamescom - Gamescom (stylized as gamescom) is a trade fair for video games held annually at the Koelnmesse in Cologne, North Rhine-Westphalia, Germany.
- GDC - An event designed to inform and educate game industry professionals on online multiplayer games, mobile and next generation game technologies.
- Strata Hadoop World - Strata Hadoop World. Where big data, cutting-edge data science, and new business fundamentals intersect–and merge.
- Spark Summit - Organized by Databricks, Spark Summit is the premier big data conference series dedicated to bring the Apache Spark community together across the globe.
- Reactive Summit - Organized by Lightbend.
- Backtracking - Backtracking is a general algorithm for finding all (or some) solutions to some computational problems.