Generic, efficient, and effective deobfuscation and semantic-aware attack detection for PowerShell scripts

被引:4
|
作者
Xiong, Chunlin [1 ]
Li, Zhenyuan [1 ]
Chen, Yan [2 ]
Zhu, Tiantian [3 ]
Wang, Jian [1 ]
Yang, Hai [4 ]
Ruan, Wei [5 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Northwestern Univ, Dept Elect Engn & Comp Sci, Evanston, IL 60208 USA
[3] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
[4] Mag Shield Co Ltd, Hangzhou 310027, Peoples R China
[5] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
PowerShell; Abstract syntax tree; Obfuscation and deobfuscation; Malicious script detection; TP309;
D O I
10.1631/FITEE.2000436
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, PowerShell has increasingly been reported as appearing in a variety of cyber attacks. However, because the PowerShell language is dynamic by design and can construct script fragments at different levels, state-of-the-art static analysis based PowerShell attack detection approaches are inherently vulnerable to obfuscations. In this paper, we design the first generic, effective, and lightweight deobfuscation approach for PowerShell scripts. To precisely identify the obfuscated script fragments, we define obfuscation based on the differences in the impacts on the abstract syntax trees of PowerShell scripts and propose a novel emulation-based recovery technology. Furthermore, we design the first semantic-aware PowerShell attack detection system that leverages the classic objective-oriented association mining algorithm and newly identifies 31 semantic signatures. The experimental results on 2342 benign samples and 4141 malicious samples show that our deobfuscation method takes less than 0.5 s on average and increases the similarity between the obfuscated and original scripts from 0.5% to 93.2%. By deploying our deobfuscation method, the attack detection rates for Windows Defender and VirusTotal increase substantially from 0.33% and 2.65% to 78.9% and 94.0%, respectively. Moreover, our detection system outperforms both existing tools with a 96.7% true positive rate and a 0% false positive rate on average.
引用
收藏
页码:361 / 381
页数:21
相关论文
共 50 条
  • [41] Energy efficient D-TLB and data cache using semantic-aware multilateral partitioning
    Lee, HHS
    Ballapuram, CS
    ISLPED'03: PROCEEDINGS OF THE 2003 INTERNATIONAL SYMPOSIUM ON LOW POWER ELECTRONICS AND DESIGN, 2003, : 306 - 311
  • [42] Semantic-aware self-supervised depth estimation for stereo 3D detection
    Sun, Hanqing
    Cao, Jiale
    Pang, Yanwei
    PATTERN RECOGNITION LETTERS, 2023, 167 : 164 - 170
  • [43] Watermarking Conditional Text Generation for AI Detection: Unveiling Challenges and a Semantic-Aware Watermark Remedy
    Fu, Yu
    Xiong, Deyi
    Dong, Yue
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 16, 2024, : 18003 - 18011
  • [44] FLAG: Few-Shot Latent Dirichlet Generative Learning for Semantic-Aware Traffic Detection
    Ye, Tianpeng
    Li, Gaolei
    Ahmad, Ijaz
    Zhang, Chaofeng
    Lin, Xiang
    Li, Jianhua
    IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2022, 19 (01): : 73 - 88
  • [45] A deep semantic-aware approach for Cantonese rumor detection in social networks with graph convolutional network
    Chen, Xinyu
    Jian, Yifei
    Ke, Liang
    Qiu, Yunxiang
    Chen, Xingshu
    Song, Yunya
    Wang, Haizhou
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 245
  • [46] Multi-Resolution and Semantic-Aware Bidirectional Adapter for Multi-Scale Object Detection
    Li, Zekun
    Pan, Jin
    He, Peidong
    Zhang, Ziqi
    Zhao, Chunlu
    Li, Bing
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [47] ROIFormer: Semantic-Aware Region of Interest Transformer for Efficient Self-Supervised Monocular Depth Estimation
    Xing, Daitao
    Shen, Jinglin
    Ho, Chiuman
    Tzes, Anthony
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 3, 2023, : 2983 - 2991
  • [48] DSRPH: Deep semantic-aware ranking preserving hashing for efficient multi-label image retrieval
    Shen, Yiming
    Feng, Yong
    Fang, Bin
    Zhou, Mingliang
    Kwong, Sam
    Qiang, Bao-hua
    INFORMATION SCIENCES, 2020, 539 : 145 - 156
  • [49] DeepACG: Co-Saliency Detection via Semantic-aware Contrast Gromov-Wasserstein Distance
    Zhang, Kaihua
    Dong, Mingliang
    Liu, Bo
    Yuan, Xiao-Tong
    Liu, Qingshan
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 13698 - 13707
  • [50] End-to-End Object-Level Contrastive Pretraining for Detection via Semantic-Aware Localization
    Geng, Long
    Huang, Xiaoming
    ARTIFICIAL INTELLIGENCE, CICAI 2023, PT I, 2024, 14473 : 293 - 304