Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Discovering Signals from Web Sources to Predict Cyber Attacks (1806.03342v1)

Published 8 Jun 2018 in cs.SI, cs.LG, and stat.ML

Abstract: Cyber attacks are growing in frequency and severity. Over the past year alone we have witnessed massive data breaches that stole personal information of millions of people and wide-scale ransomware attacks that paralyzed critical infrastructure of several countries. Combating the rising cyber threat calls for a multi-pronged strategy, which includes predicting when these attacks will occur. The intuition driving our approach is this: during the planning and preparation stages, hackers leave digital traces of their activities on both the surface web and dark web in the form of discussions on platforms like hacker forums, social media, blogs and the like. These data provide predictive signals that allow anticipating cyber attacks. In this paper, we describe machine learning techniques based on deep neural networks and autoregressive time series models that leverage external signals from publicly available Web sources to forecast cyber attacks. Performance of our framework across ground truth data over real-world forecasting tasks shows that our methods yield a significant lift or increase of F1 for the top signals on predicted cyber attacks. Our results suggest that, when deployed, our system will be able to provide an effective line of defense against various types of targeted cyber attacks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Palash Goyal (31 papers)
  2. KSM Tozammel Hossain (3 papers)
  3. Ashok Deb (8 papers)
  4. Nazgol Tavabi (8 papers)
  5. Nathan Bartley (5 papers)
  6. Andr'es Abeliuk (1 paper)
  7. Emilio Ferrara (197 papers)
  8. Kristina Lerman (197 papers)
Citations (22)

Summary

We haven't generated a summary for this paper yet.