Wednesday, December 2, 2020

Big Data architecture technology family tree

Big Data Analytics Catalog 

  • Integration
    • Messaging
      • Data Collector
        • Apache Flume
        • Logstash
        • Fluentd
      • Distributed Message Broker
        • Apache Kafka
        • RabbitMQ
        • Amazon SQS
        • Apache ActiveMQ
    • ETL/ELT
      • ETL/Data Integration Engine
        • StreamSets
        • Talend
        • Informatica
  • Data Storage
    • Distributed File System
      • HDFS
      • CassandraFS
    • NoSQL Database
      • Key-Value
        • Riak
        • Redis
        • Berkeley DB
      • Document-Oriented
        • MongoDB
        • CouchDB
      • Column-Family
        • HBase
        • Cassandra
      • Graph-Oriented
        • Neo4J
        • OrientDB
    • Analytic RDBMS
      • MPP Analytic RDBMS
        • HP Vertica
        • Teradata
        • Microsoft SQL Server Parallel Data Warehouse (MS PDW)
        • Amazon Redshift
      • Traditional Analytic RDBMS
        • MS SQL Server
        • Oracle RDBMS
        • IBM DB2
  • Processing & Analytics
    • Visualization & Reporting
      • BI Platform
        • QlikView
        • Microstrategy
        • Tableau
        • Tibco JasperSoft
        • Pentaho
      • Interactive Dashboard
        • Splunk
        • Kibana
        • Zoomdata
      • Graphic Library
        • D3.js
        • GoJS
        • Highcharts
    • Search & Query
      • Interactive Query Engine
        • Impala
        • Apache Hive (Stinger)
        • Spark SQL
      • Distributed Search Engine
        • Splunk
        • Elasticsearch
        • Apache Solr
    • Processing
      • Distributed Computing Engine
        • Hadoop MapReduce
        • Apache Spark
        • Apache Tez
      • Event Stream Processor
        • Apache Storm
        • Spark Streaming
        • Apache Samza
        • Amazon Kinesis
      • Data Processing Framework
        • Cascading
        • Apache Crunch
        • Apache Hive
        • Amazon Pig

Reference:

Designing Software Architectures: A Practical Approach by Humberto Cervantes & Rick Kazman

No comments:

Post a Comment

Why is Go fast?

Why is Go fast? Go has become popular for microprocesses & for scaling. What are the design decisions that make Go fast? Summary: 1. Cle...