Simple algorithms of the fast inverse square root with the use of magic constant with reduced relative errors for numbers of type float are described in the paper.
- Multiplier-free divide, square root, and log algorithms / F. Auger, Z. Lou, B. Feuvrie, F. Li // IEEE Signal Process. Mag. 2011. Vol. 28. No. 4. P. 122–126.
- Allie M. A. Root of Less Evil / M. Allie, R. Lyons // IEEE Signal Process. Mag.: DSP Tips and Tricks. 2005. Vol. 22. P. 93–96.
- Parhami B. Computer Arithmetic: Algorithms and Hardware Designs / B. Parhami. 2nd ed. New York : Oxford Univ. Press, 2010.
- Lemaitre Florian. Cholesky Factorization on SIMD multi-core architectures / Florian Lemaitre, Benjamin Couturier, Lionel Lacassagne // Journal of Systems Architecture. Elsevier, 2017. Vol. 79. P. 1–15.
- A Fast FPGA Based Architecture for Computation of Square Root and Inverse Square Root / A. Hasnat,T. Bhattacharyya, A. Dey, S. Halder, D. Bhattacharjee // Devices for Integrated Circuit (DevIC): int. conf., 23–24 Mar., 2017. Kalyani, 2017. P. 383–387.
- Beebe N. H. F. The Mathematical-Function Computation Handbook: Programming Using the MathCW Portable Software Library / N. H. F. Beebe. Springer, 2017.
- Optimizations of Two Compute-bound Scientific Kernels on the SW26010 Many-core Processor / J. Lin, Z. G. Xu, A. Nukada, N. Maruyama, S. Matsuoka // 46th International Conference on Parallel Processing, 14–17 Aug. 2017. Bristol : IEEE, 2017. P. 432–441.
- Improving Deep Learning By Inverse Square Root Linear Units (ISRLUS) / Brad Carlile, Guy Delamarter, Paul Kinney, Akiko Marti, Brian Whitney. 2018.
- Andriy Hrynchyshyn. An efficient algorithm for fast inverse square root / Hrynchyshyn Andriy, Horyachyy Oleh, Tymoshenko Oleksandr // Przetwarzanie, transmisja i bezpieczeństwo informacji. Bielsko-Biała : Wydawnictwo Naukowe ATH w Bielsku-Białej, 2018. T. 2. P. 105–113.
- Hanninen T. Novel detector implementations for 3G LTE downlink and uplink / T. Hanninen, J. Janhunen, M. Juntti // Analog. Integr. Circ. Sig. Process. 2014. Vol. 78. No. 3. P. 645–655.
- Floating point unit demonstration on STM32 microcontrollers: Application note AN4044. STMicroelectronics N.V., 2016.
- ARM® NEON™ Intrinsics Reference: IHI 0073B. ARM Limited, 2016.
- Hsu C. J. An Efficient Hardware Implementation of HON4D Feature Extraction for Real-time Action Recognition / C. J. Hsu, J. L. Chen, L. G. Chen // IEEE International Symposium on Consumer Electronics (ISCE). 2015.
- A UWB Radar Signal Processing Platform for Real-Time Human Respiratory Feature Extraction Based on Four-Segment Linear Waveform Model / C. H. Hsieh, Y. F. Chiu, Y. H. Shen, T. S. Chu, Y. H. Huang // IEEE Trans. Biomed. Circ. Syst. 2016. Vol. 10. No. 1. P. 219–230.
- Ziqiang Li. OFDM Synchronization implementation based on Chisel platform for 5G research / Li Ziqiang, Chen Yun, Zeng Xiaoyang // IEEE 11th International Conference on ASIC (ASICON). Chengdu : IEEE, 2015. P. 1–4.
- Sangeetha D. Efficient Scale Invariant Human Detection using Histogram of Oriented Gradients for IoT Services / D. Sangeetha, P. Deepa // IEEE 30th International Conference on VLSI Design and 16th International Conference on EmbeddedSystems. Hyderabad : IEEE, 2017. P. 61–66.
- Fog A. Software optimization resources, Instruction tables: Lists of instruction latencies, throughputs and micro-operation breakdowns for Intel, AMD and VIA CPUs [Electronic resource] / A. Fog. Regime of access: http://www.agner.org/optimize/.
- x86 and amd64 instruction reference [Electronic resource]. Regime of access: http://www.felixcloutier.com/x86/index.html.
- Lomont C. Fast inverse square root [Electronic resource] / C. Lomont // Purdue University : Tech. Rep. – 2003. – Regime of access: http://www.lomont.org/Math/Papers/2003/InvSqrt.pdf.
- Blinn J. Floating-point tricks / J. Blinn // IEEE Comput. Graphics Appl. IEEE, 1997. Vol. 17. No. 4. P. 80–84.
- Zafar S. Hardware architecture design and mapping of “Fast Inverse Square Root’s algorithm” / S. Zafar, R. Adapa // International Conference on Advances in Electrical Engineering (ICAEE). 2014. P. 1–4.
- Martin P. Eight Rooty Pieces / P. Martin // Overload Journal. No. 135. 2016. P. 8–12.
- Fast calculation of inverse square root with the use of magic constant – analytical approach / L. Moroz, C. J. Walczyk, A. Hrynchyshyn, V. Holimath, J.L. Cieslinski // Appl. Math. Computation. Elsevier, 2018. Vol. 316.P. 245–255.
- Eberly D. H. GPGPU Programming for Games and Science / D. H. Eberly. Florida : CRC Press, 2015.
- Walczyk C. J. Improving the accuracy of the fast inverse square root algorithm [Electronic resource] / C. J. Walczyk, L. V. Moroz, J. L. Cieslinski. – arXiv preprint arXiv: 1802.06302. 2018 Regime of access: https://arxiv.org/pdf/1802.06302.pdf.