real time log processing using apache spark streaming

后端 未结 3 548
情书的邮戳
情书的邮戳 2021-02-06 02:05

I want to create a system where I can read logs in real time, and use apache spark to process it. I am confused if I should use something like kafka or flume to pass the logs to

3条回答
  •  灰色年华
    2021-02-06 03:07

    Although this is a old question, posting a link from Databricks, which has a great step by step article for log analysis with Spark considering many areas.

    https://databricks.gitbooks.io/databricks-spark-reference-applications/content/logs_analyzer/index.html

    Hope this helps.

提交回复
热议问题