Floating Point

Creator
Creator
Seonglae ChoSeonglae Cho
Created
Created
2021 Apr 22 2:31
Editor
Edited
Edited
2025 Jan 14 9:45
Refs
Refs
Posits

Exponent + Mantissa

지수의 값에 따라 소수점이 움직임
E4M3, E5M2 for FP8 with sign bit
Floating Point Notion
 
 
 
 

new way to

Floating point computations today and in the future - Boguslaw Cyganek - Meeting C++ 2022
Floating point computations today and in the future - Boguslaw Cyganek - Meeting C++ 2022 Slides: https://slides.meetingcpp.com Survey: https://survey.meetingcpp.com When we see large numbers or small fractions we automatically pick float or double floating-point types for our data representation. However, is it always optimal? What other choices we have? Are there any other standards than the IEEE 754 FP? In this talk I’ll present such topics as: Brief intro to the FP IEEE 754 standard, its pros, cons and pitfalls when computing in the FP domain. How can we compress floating point data – shorter formats, lossless & lossy compressions, new ZFP and SZ methods and their use in C++. A new kid on the block – the POSIT arithmetic and how to use it in C++ programs. Intro to the relevant C++ libraries and – as always – a lot of practical examples. Few notes on deep learning and FP formats.
Floating point computations today and in the future - Boguslaw Cyganek - Meeting C++ 2022

Ways

Floating-point arithmetic
In computing, floating-point arithmetic (FP) is arithmetic that represents real numbers approximately, using an integer with a fixed precision, called the significand, scaled by an integer exponent of a fixed base. For example, 12.345 can be represented as a base-ten floating-point number:
Floating-point arithmetic
 

Recommendations