LDA in network traffic data(proxy, firewall dns)
https://community.hortonworks.com/articles/84781/spark-text-analytics-uncovering-data-driven-topics.html
https://databricks-prod-cloudfront.cloud.databricks.com/public/4027ec902e239c93eaaa8714f173bcfc/3741049972324885/3783546674231782/4413065072037724/latest.html https://stackoverflow.com/questions/42051184/latent-dirichlet-allocation-lda-in-spark https://dzone.com/articles/spark-lda-a-complete-example-of-clustering-algorit http://blog.echen.me/2011/08/22/introduction-to-latent-dirichlet-allocation/ https://github.com/shiv4nsh/spark-LDA-example/blob/master/src/main/scala/LDAExample.scala https://github.com/pcejrowski/spark-lda-example/blob/master/src/main/scala/pl/pcejrowski/LdaTopics.scala https://github.com/zaratsian/Spark/blob/7796387b64abd3ea2f449e16f241f98ccab1bb5f/text_analytics_datadriven_topics.py