Faithfully Rounded Floating-point Computations

被引:8
|
作者
Lange, Marko [1 ]
Rump, Siegfried M. [1 ,2 ]
机构
[1] Waseda Univ, Fac Sci & Engn, Shinjuku Ku, 3-4-1 Okubo, Tokyo 1698555, Japan
[2] Hamburg Univ Technol, Inst Reliable Comp, Schwarzenberg Campus 3, D-21071 Hamburg, Germany
来源
基金
日本科学技术振兴机构;
关键词
Double-double; inaccurate cancellation; rigorous error bounds; ACCURATE;
D O I
10.1145/3290955
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We present a pair arithmetic for the four basic operations and square root. It can be regarded as a simplified, more-efficient double-double arithmetic. The central assumption on the underlying arithmetic is the first standard model for error analysis for operations on a discrete set of real numbers. Neither do we require a floating-point grid nor a rounding to nearest property. Based on that, we define a relative rounding error unit u and prove rigorous error bounds for the computed result of an arbitrary arithmetic expression depending on u, the size of the expression, and possibly a condition measure. In the second part of this note, we extend the error analysis by examining requirements to ensure faithfully rounded outputs and apply our results to IEEE 754 standard conform floating-point systems. For a class of mathematical expressions, using an IEEE 754 standard conform arithmetic with base beta, the result is proved to be faithfully rounded for up to 1/root beta u - 2 operations. Our findings cover a number of previously published algorithms to compute faithfully rounded results, among them Horner's scheme, products, sums, dot products, or Euclidean norm. Beyond that, several other problems can be analyzed, such as polynomial interpolation, orientation problems, Householder transformations, or the smallest singular value of Hilbert matrices of large size.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Rigorous Roundoff Error Analysis of Probabilistic Floating-Point Computations
    Constantinides, George
    Dahlqvist, Fredrik
    Rakamaric, Zvonimir
    Salvia, Rocco
    COMPUTER AIDED VERIFICATION, PT II, CAV 2021, 2021, 12760 : 626 - 650
  • [22] Static analysis-based validation of floating-point computations
    Putot, S
    Goubault, E
    Martel, M
    NUMERICAL SOFTWARE WITH RESULT VERIFICATION, 2004, 2991 : 306 - 313
  • [23] Floating-point versus symbolic computations in the QD-algorithm
    Cuyt, A
    JOURNAL OF SYMBOLIC COMPUTATION, 1997, 24 (06) : 695 - 703
  • [24] Asserting the precision of floating-point computations: A simple abstract interpreter
    Goubault, E
    Martel, M
    Putot, S
    PROGRAMMING LANGUAGES AND SYSTEMS, PROCEEDINGS, 2002, 2305 : 209 - 212
  • [25] INEXPENSIVE CORRECTLY ROUNDED FLOATING-POINT DIVISION AND SQUARE ROOT WITH INPUT SCALING
    Viitanen, Timo
    Jaaskelainen, Pekka
    Takala, Jarmo
    2013 IEEE WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS), 2013, : 159 - 164
  • [26] Accelerating correctly rounded floating-point division when the divisor is known in advance
    Brisebarre, N
    Muller, JM
    Raina, SK
    IEEE TRANSACTIONS ON COMPUTERS, 2004, 53 (08) : 1069 - 1072
  • [27] Parallel floating-point expansions for extended-precision GPU computations
    Collange, Sylvain
    Joldes, Mioara
    Muller, Jean-Michel
    Popescu, Valentina
    2016 IEEE 27TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS (ASAP), 2016, : 139 - 146
  • [28] A Unified Coq Framework for Verifying C Programs with Floating-Point Computations
    Ramananandro, Tahina
    Mountcastle, Paul
    Meister, Benoit
    Lethin, Richard
    PROCEEDINGS OF THE 5TH ACM SIGPLAN CONFERENCE ON CERTIFIED PROGRAMS AND PROOFS (CPP'16), 2016, : 15 - 26
  • [29] A priori worst-case error bounds for floating-point computations
    Kramer, W
    13TH IEEE SYMPOSIUM ON COMPUTER ARITHMETIC, PROCEEDINGS, 1997, : 64 - 71
  • [30] SINGLE-CHIP ACCELERATORS SPEED FLOATING-POINT AND BINARY COMPUTATIONS
    QUONG, D
    PERLMAN, R
    ELECTRONIC DESIGN, 1984, 32 (23) : 246 - &