A Productivity Improvement of Distributed Software Testing using Checkpoint

被引:0
|
作者
Sudsee, Bhuridech [1 ]
Kaewkasi, Chanwit [1 ]
机构
[1] Suranaree Univ Technol, Sch Comp Engn, Nakhon Ratchasrima 30000, Thailand
关键词
Distributed Checkpointing; Apache Spark; Big Data Testing; Software Testing;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The advancement of storage technologies and the fast-growing number of generated data have made the world moved into the Big Data era. In this past, we had many data mining tools but they are inadequate to process Data-Intensive Scalable Computing workloads. The Apache Spark framework is a popular tool designed for Big Data processing. It leverages in-memory processing techniques that make Spark up to 100 times faster than Hadoop. Testing this kind of Big Data program is time consuming. Unfortunately, developers lack a proper testing framework, which cloud help assure quality of their data-intensive processing programs, while saving development time. We propose Distributed Test Checkpointing (DTC) for Apache Spark. DTC applies unit testing to the Big Data software development life cycle and reduce time spent for each testing loop with checkpoint. From the experimental results, we found that in the subsequence rounds of unit testing, DTC dramatically speed the testing time up to 450-500% faster. In case of storage, DTC can cut unnecessary data off and make the storage 19.7 times saver than the original checkpoint of Spark.
引用
收藏
页码:78 / 84
页数:7
相关论文
共 50 条
  • [1] Improvement of software quality and productivity using development tools
    Abotani, Hideo
    Shiratori, Tomoki
    Sasaki, Kouji
    Tonomura, Masaki
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2006, 42 (03): : 323 - 332
  • [2] Checkpoint processing in distributed systems software using synchronized clocks
    Neogy, S
    Sinha, A
    Das, PK
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, PROCEEDINGS, 2001, : 555 - 559
  • [3] An Improvement of a Checkpoint-based Distributed Testing Technique on a Big Data Environment
    Sudsee, Bhuridech
    Kaewkasi, Chanwit
    2019 21ST INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY (ICACT): ICT FOR 4TH INDUSTRIAL REVOLUTION, 2019, : 1081 - 1090
  • [4] SOFTWARE PRODUCTIVITY NEEDS TOOLS FOR IMPROVEMENT
    SCHINDLER, M
    ELECTRONIC DESIGN, 1980, 28 (17) : 45 - &
  • [5] THE DEVELOPMENT OF CUSTOMIZED SOFTWARE FOR PRODUCTIVITY IMPROVEMENT
    GUSTAFSON, S
    COMPUTERS & INDUSTRIAL ENGINEERING, 1989, 17 : 159 - 163
  • [6] Parallel testing of distributed software
    Lastovetsky, A
    INFORMATION AND SOFTWARE TECHNOLOGY, 2005, 47 (10) : 657 - 662
  • [7] Testing and debugging of distributed software
    Cunha, JC
    Krawczyk, H
    COMPUTERS AND ARTIFICIAL INTELLIGENCE, 2000, 19 (06): : 495 - 510
  • [8] SOFTWARE QUALITY AND PRODUCTIVITY IMPROVEMENT ACTIVITIES AT NEC
    Honda, Naomi
    Yamada, Shigeru
    ICIM2012: PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON INDUSTRIAL MANAGEMENT, 2012, : 3 - 8
  • [9] Analysis on productivity improvement in Chinese software enterprises
    Gan, WH
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND ENGINEERING MANAGEMENT, VOLS 1 AND 2: MODERN INDUSTRIAL ENGINEERING AND INNOVATION IN ENTERPRISE MANAGEMENT, 2005, : 1225 - 1230
  • [10] Isolated Testing of Software Components in Distributed Software Systems
    Thillen, Francois
    Mordinyi, Richard
    Biffl, Stefan
    SOFTWARE QUALITY: MODEL-BASED APPROACHES FOR ADVANCED SOFTWARE AND SYSTEMS ENGINEERING, 2014, 166 : 170 - 184