Microbatch ingestion using Spark Streaming and Cassandra

In this post you’ll learn how to ingest data files in near real time using Spark Structured Streaming framework and storing them in Cassandra database. Spark Structured Streaming allows the programmer to implement the streaming solutions by means of dataframes instead of RDDs. This makes the development much more agile while preserving the performance thanks…

Details

Online Platform for Clinical and Molecular data Retrieval

Online Platform for Clinical and Molecular data Retrieval Introduction This post is about describing a technologies stack used in a project aimed to create an online platform for clinical and molecular data retrieval. The online platform also includes a functionality to analyse molecular data stored in the cloud. The context of the project is in…

Details