Information jet: Handling noisy big data from weakly disconnected network

POSTER

Abstract

Sudden aggregation (information jet) of large amount of data is ubiquitous around connected social networks, driven by sudden interacting and non-interacting events, network security threat attacks, online sales channel etc. Clustering of information jet based on time series analysis and graph theory is not new but little work is done to connect them with particle jet statistics. We show pre-clustering based on context can element soft network or network of information which is critical to minimize time to calculate results from noisy big data. We show difference between, stochastic gradient boosting and time series-graph clustering. For disconnected higher dimensional information jet, we use Kallenberg representation theorem (Kallenberg, 2005, arXiv: 1401.1137) to identify and eliminate jet similarities from dense or sparse graph.

Authors

  • Deeder Aurongzeb

    Texas A&M university, Department of Statistics.