Scalable architecture for Big Data financial analytics: user-defined functions vs. SQL

被引:6
|
作者
Stockinger, Kurt [1 ]
Bundi, Nils [2 ,3 ]
Heitz, Jonas [1 ]
Breymann, Wolfgang [1 ]
机构
[1] Zurich Univ Appl Sci, Technikumstr 9, CH-8400 Winterthur, Switzerland
[2] Ariadne Business Analyt AG, Alpenstr 16, CH-6300 Zug, Switzerland
[3] Stevens Inst Technol, 1 Castle Point Terrace, Hoboken, NJ 07030 USA
关键词
Financial analytics; Query processing; User-defined functions; Performance evaluation; SPARK; TRENDS;
D O I
10.1186/s40537-019-0209-0
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Large financial organizations have hundreds of millions of financial contracts on their balance sheets. Moreover, highly volatile financial markets and heterogeneous data sets within and across banks world-wide make near real-time financial analytics very challenging and their handling thus requires cutting edge financial algorithms. However, due to a lack of data modeling standards, current financial risk algorithms are typically inconsistent and non-scalable. In this paper, we present a novel implementation of a real-world use case for performing large-scale financial analytics leveraging Big Data technology. We first provide detailed background information on the financial underpinnings of our framework along with the major financial calculations. Afterwards we analyze the performance of different parallel implementations in Apache Spark based on existing computation kernels that apply the ACTUS data and algorithmic standard for financial contract modeling. The major contribution is a detailed discussion of the design trade-offs between applying user-defined functions on existing computation kernels vs. partially re-writing the kernel in SQL and thus taking advantage of the underlying SQL query optimizer. Our performance evaluation demonstrates almost linear scalability for the best design choice.
引用
收藏
页数:24
相关论文
共 22 条
  • [1] Scalable architecture for Big Data financial analytics: user-defined functions vs. SQL
    Kurt Stockinger
    Nils Bundi
    Jonas Heitz
    Wolfgang Breymann
    Journal of Big Data, 6
  • [2] User-Defined Financial Functions for MS SQL Server
    Gubalova, Jolana
    Medvedova, Petra
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (09) : 19 - 25
  • [3] White-Box Testing of Big Data Analytics with Complex User-Defined Functions
    Gulzar, Muhammad Ali
    Mardani, Shaghayegh
    Musuvathi, Madanlal
    Kim, Miryung
    ESEC/FSE'2019: PROCEEDINGS OF THE 2019 27TH ACM JOINT MEETING ON EUROPEAN SOFTWARE ENGINEERING CONFERENCE AND SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING, 2019, : 290 - 301
  • [4] Efficient Execution of User-Defined Functions in SQL Queries
    Foufoulas, Yannis
    Simitsis, Alkis
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 16 (12): : 3874 - 3877
  • [5] Theoretically-Defined vs. User-Defined Squeeze Gestures
    Villarreal-Narvaez S.
    Sluyters A.
    Vanderdonckt J.
    Luzayisu E.M.
    Proceedings of the ACM on Human-Computer Interaction, 2022, 6 (ISS): : 73 - 102
  • [6] Supporting User-Defined Functions on Uncertain Data
    Tran, Thanh T. L.
    Diao, Yanlei
    Sutton, Charles
    Liu, Anna
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (06): : 469 - 480
  • [7] SQL-SA for Big Data Discovery Polymorphic and Parallelizable SQL User-Defined Scalar and Aggregate Infrastructure in Teradata Aster 6.20
    Tang, Xin
    Wehrmeister, Robert
    Shau, James
    Chakraborty, Abhirup
    Alex, Daley
    Al Omari, Awny
    Atnafu, Feven
    Davis, Jeff
    Deng, Litao
    Jaiswal, Deepak
    Keswani, Chittaranjan
    Lu, Yafeng
    Ren, Chao
    Reyes, Tom
    Siddiqui, Kashif
    Simmen, David
    Vidhani, Devendra
    Wang, Ling
    Yang, Shuai
    Yu, Daniel
    2016 32ND IEEE INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2016, : 1182 - 1193
  • [8] SQL/MapReduce: A practical approach to self-describing, polymorphic, and parallelizable user-defined functions
    Friedman, Eric
    Pawlowski, Peter
    Cieslewicz, John
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (02): : 1402 - 1413
  • [9] YeSQL: "You extend SQL" with Rich and Highly Performant User-Defined Functions in Relational Databases
    Foufoulas, Yannis
    Simitsis, Alkis
    Stamatogiannakis, Lefteris
    Ioannidis, Yannis
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (10): : 2270 - 2283
  • [10] Big Data vs. Data Mining for Social Media Analytics
    Danubianu, M.
    Barila, A.
    SMART 2014 - SOCIAL MEDIA IN ACADEMIA: RESEARCH AND TEACHING, 2015, : 261 - 269