A semantics-based approach to malware detection

被引:39
|
作者
Preda, Mila Dalla [1 ]
Christodorescu, Mihai [2 ]
Jha, Somesh [2 ]
Debray, Saumya [3 ]
机构
[1] Univ Verona, I-37100 Verona, Italy
[2] Univ Wisconsin, Madison, WI 53706 USA
[3] Univ Arizona, Tucson, AZ USA
基金
美国国家科学基金会;
关键词
security; languages; theory; verification; malware detection; obfuscation; trace semantics; abstract interpretation;
D O I
10.1145/1387673.1387674
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Malware detection is a crucial aspect of software security. Current malware detectors work by checking for signatures, which attempt to capture the syntactic characteristics of the machine-level byte sequence of the malware. This reliance on a syntactic approach makes current detectors vulnerable to code obfuscations, increasingly used by malware writers, that alter the syntactic properties of the malware byte sequence without significantly affecting their execution behavior. This paper takes the position that the key to malware identification lies in their semantics. It proposes a semantics-based framework for reasoning about malware detectors and proving properties such as soundness and completeness of these detectors. Our approach uses a trace semantics to characterize the behavior of malware as well as that of the program being checked for infection, and uses abstract interpretation to "hide" irrelevant aspects of these behaviors. As a concrete application of our approach, we show that (1) standard signature matching detection schemes are generally sound but not complete, (2) the semantics-aware malware detector proposed by Christodorescu et al. is complete with respect to a number of common obfuscations used by malware writers and (3) the malware detection scheme proposed by Kinder et al. and based on standard model-checking techniques is sound in general and complete on some, but not all, obfuscations handled by the semantics-aware malware detector.
引用
收藏
页数:54
相关论文
共 50 条
  • [31] Semantics-based video indexing using a stochastic modeling approach
    Wei, Yong
    Bhandarkar, Suchendra M.
    Li, Kang
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 2009 - 2012
  • [32] GeoCosm: A semantics-based approach for information integration of geospatial data
    Ram, S
    Khatri, V
    Zhang, LM
    Zeng, DD
    CONCEPTUAL MODELING FOR NEW INFORMATION SYSTEMS TECHNOLOGIES, 2002, 2465 : 152 - 165
  • [33] Personal name resolution crossover documents by a semantics-based approach
    Phan, XH
    Nguyen, LM
    Horiguchi, S
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (02): : 825 - 836
  • [34] Professional services automation: A semantics-based approach for knowledge management
    Kashyap, V
    Dalal, S
    Tukey, P
    Behrens, C
    KNOWLEDGE MANAGEMENT & INTELLIGENT ENTERPRISES, 2001, : 10 - 25
  • [35] A semantics-based approach to design of query languages for partial information
    Libkin, L
    SEMANTICS IN DATABASES, 1998, 1358 : 170 - 208
  • [36] A Semantics-Based Approach for Business Categorization on Social Networking Sites
    Memon, Atia Bano
    Zinke, Christian
    Meyer, Kyrill
    COLLABORATION IN A DATA-RICH WORLD, 2017, 506 : 678 - 687
  • [37] A Semantics-Based Hybrid Approach on Binary Code Similarity Comparison
    Hu, Yikun
    Wang, Hui
    Zhang, Yuanyuan
    Li, Bodong
    Gu, Dawu
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2021, 47 (06) : 1241 - 1258
  • [38] Semantics-Based Information Valuation
    Al-Saffar, Sinan
    Heileman, Gregory L.
    2008 4TH INTERNATIONAL IEEE CONFERENCE INTELLIGENT SYSTEMS, VOLS 1 AND 2, 2008, : 284 - 291
  • [39] Semantics-Based Code Search
    Reiss, Steven P.
    2009 31ST INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2009, : 243 - 253
  • [40] A semantics-based consultations workbench
    Vassilakis, C
    Gouscos, D
    Georgiadis, P
    Enabling Technologies for the New Knowledge Society, 2005, : 421 - 434