Analyzing the Impact of Approximate Arithmetic on Deep Neural Network Predictions

Uloženo v:

Podrobná bibliografie
Vydáno v:	ProQuest Dissertations and Theses (2025)
Hlavní autor:	Garcia, Johnatan
Vydáno:	ProQuest Dissertations & Theses
Témata:	Computer science Artificial intelligence
On-line přístup:	Citation/Abstract Full Text - PDF
Tagy:	Přidat tag Žádné tagy, Buďte první, kdo vytvoří štítek k tomuto záznamu!

MARC


LEADER	00000nab a2200000uu 4500
001	3241059194
003	UK-CbPIL
020			\|a 9798290993232
035			\|a 3241059194
045	2		\|b d20250101 \|b d20251231
084			\|a 66569 \|2 nlm
100	1		\|a Garcia, Johnatan
245	1		\|a Analyzing the Impact of Approximate Arithmetic on Deep Neural Network Predictions
260			\|b ProQuest Dissertations & Theses \|c 2025
513			\|a Dissertation/Thesis
520	3		\|a In recent times, we have seen the use of artificial intelligence in our daily lives. It helps us solve complicated problems. Some of these problems can be large and complex, requiring large models. As models grow in complexity, they require more computations and energy to be trained and tested. The execution of these models relies on floating-point arithmetic, which imposes constraints due to its finite precision. Due to these limitations, many of these computations are not exact. When this happens, computers are forced to round or approximate. We can use several number formats to circumvent this issue. For example, in single precision, we are allowed to use 24 binary digits, and in double precision, we are allowed to use 53 bits of precision. We can also explore small formats like FP8, which could have 3 or 4 bits of precision. The importance of choosing the right format can drastically reduce the resources needed and allow us to increase or decrease the precision depending on the model’s performance.As it propagates through the model, the error caused by rounding is compounded across the different layers and may have an impact on the model’s final prediction. If we can analyze the rounding errors, we are then able to increase or decrease the model’s precision to better optimize the resources and predictions. If we notice almost no error, we are then able to reduce the precision, optimizing the time and memory needed. In this work, we contributed by developing a software that uses the PyTorch C++ API to load and analyze the impact of the rounded error produced. We tested our software not only with standard forward-feeding models, but with deep learning models as well. We built this by using our implementation of the tensor core that allows custom floating-point operations to be performed. With this class, we can produce the relative error, absolute error, and an upper and lower bound of where the final answer may be.
653			\|a Computer science
653			\|a Artificial intelligence
773	0		\|t ProQuest Dissertations and Theses \|g (2025)
786	0		\|d ProQuest \|t ProQuest Dissertations & Theses Global
856	4	1	\|3 Citation/Abstract \|u https://www.proquest.com/docview/3241059194/abstract/embedded/L8HZQI7Z43R0LA5T?source=fedsrch
856	4	0	\|3 Full Text - PDF \|u https://www.proquest.com/docview/3241059194/fulltextPDF/embedded/L8HZQI7Z43R0LA5T?source=fedsrch