Presentation: Tweet"Distributed Processing for Distributed Database"
Today distribution matters. There are several parts of a project that have to be distributed, due to capacity, scalability or availability. The reality is that there are several different dimensions of the computational landscape that need and can be dealt with in a distributed manner. From hardware provisioning and databases to the most recent distributed processing platforms developers have several different tools to fully distribute applications. During this talk we will look into the details of data management and processing over distributed systems.
We will dive into the details of the distribution of data using MongoDB and distributed processing using Apache Flink and Apache Spark. We will cover:
- How to optimize MongoDB for distributed environments
- How to connect Apache Flink with MongoDB
- How to bridge existing data in MongoDB into Spark for Streaming processing