Moving to floats from doubles?

Gamestudio Links

Zorro Links

Newest Posts

Help with plotting multiple ZigZag
by degenerate_762. 04/30/24 23:23

M1 Oversampling
by 11honza11. 04/30/24 08:16

Texture/Sprite animation tool. Someone interested in ?
by NeoDumont. 04/29/24 02:20

Trading Journey
by howardR. 04/28/24 09:55

Zorro Trader GPT
by TipmyPip. 04/27/24 13:50

Data from CSV not parsed correctly
by jcl. 04/26/24 11:18

Why Zorro supports up to 72 cores?
by jcl. 04/26/24 11:09

Eigenwerbung
by jcl. 04/26/24 11:08

AUM Magazine

Latest Screens

Who's Online Now

2 registered members (degenerate_762, AndrewAMD), 877 guests, and 5 spiders.

Key: Admin, Global Mod, Mod

Newest Members

wandaluciaia, Mega_Rod, EternallyCurious, howardR, 11honza11
19049 Registered Users

Print Thread

Rate Thread

Moving to floats from doubles? #486082 06/08/22 11:45 06/08/22 11:45
Joined: Jul 2017 Posts: 784 Z Zheka OP User
Zheka OP User Z Joined: Jul 2017 Posts: 784	I would like to suggest to move Zorro's default 'var' from type double to type float. Most trading system calculations involve looping over N prior values to do some sort of summing/dividing/multiplying. With floats, 2X as many values will be prefetched to the CPU L1 cache, there will be fewer cache misses, and with c++ compilers auto-vectorization performance with floats on loops can be nearly 2X than with doubles. Division, sqrt, pow, exp, log, trigonometric functions can be 2x faster with floats; TA_Lib also has a version that consumes float arrays. I did some measurements several years ago - and the speed up was quite significant. I understand type conversion for intermediate calculations can be costly, but this will occasional and a less common case than looping over N prices/values. Float max value is big enough for overflow to occur very rarely, if ever. What do you think?

Re: Moving to floats from doubles? [Re: Zheka] #486086 06/08/22 13:20 06/08/22 13:20
Joined: Feb 2017 Posts: 1,725 Chicago AndrewAMD Serious User
AndrewAMD Serious User Joined: Feb 2017 Posts: 1,725 Chicago	To change var from double to float has two other problems: 1) It would break all kinds of script code. (A big no-no.) 2) Precision loss. Maybe better to add float support as a standalone feature? Like fseries() for float, typedef float fvar, and typedef float* fvars, where the 'f' is for "fast"? And then write the indicator functions to have both var and fvar overloads. This simplifies scripting. www.vitaltrades.com - andrew@vitaltrades.com

Re: Moving to floats from doubles? [Re: Zheka] #486090 06/08/22 14:49 06/08/22 14:49
Joined: Jul 2000 Posts: 27,986 Frankfurt jcl Chief Engineer
jcl Chief Engineer Joined: Jul 2000 Posts: 27,986 Frankfurt	https://articles.emptycrate.com/2012/02/11/double_vs_float_which_is_better.html

Re: Moving to floats from doubles? [Re: Zheka] #486094 06/08/22 18:42 06/08/22 18:42
Joined: Jul 2017 Posts: 784 Z Zheka OP User
Zheka OP User Z Joined: Jul 2017 Posts: 784	Jason's article is not based on a realistic use case. The 1st comment below confirms my experience: looping over a vector is 2X+ faster with floats vs doubles (and compilers/autovectorization have improved since) I would trade 2X performance difference for possible loss of precision any time. Given noisiness of financial time series, 'precision' is not meaningful that much (unlike with rocket trajectory calculations). But performance gain will not only be achieved in loops: Zorro's structures will be much more compact, leading to fewer cache misses and significantly better performance. + Many operations - division, sqrt, etc - will be 2x faster. With templating, most functions called from a user c++ code can work without changes to the syntax. How best to handle it for lite-c api - I don't know.

Re: Moving to floats from doubles? [Re: Zheka] #486098
06/08/22 20:51 06/08/22 20:51

Joined: Feb 2017
Posts: 1,725
Chicago

AndrewAMD

Serious User

AndrewAMD

Serious User

Joined: Feb 2017
Posts: 1,725
Chicago

Here's an actual *.cpp Zorro script. float is not any faster.

Code

#include <chrono>
#include <typeinfo>
#include <vector>
#include <zorro.h>

template<typename T> void sum_test1(int num_times, T value)
{
	T val = 0;

	std::chrono::high_resolution_clock::time_point t1 = std::chrono::high_resolution_clock::now();
	for (int i = 0; i < num_times; ++i)
	{
		val += value;
		if (!wait(-100000)) {
			printf("\nAborted!");
			return;
		}
	}
	auto dur = std::chrono::high_resolution_clock::now() - t1;
	auto dur_ms = std::chrono::duration_cast<std::chrono::milliseconds>(dur);
	printf("\nType name: %s", typeid(T).name());
	printf("\n... Size in bytes : %d", sizeof(T));
	printf("\n... Summation time: %d ms", dur_ms);
	printf("\n... Summed value: %0.8f", (double)val);
}

template<typename T> void vec_test2(size_t vecSize, int num_times, T value)
{
	std::vector<T> vec;
	vec.resize(vecSize);
	std::chrono::high_resolution_clock::time_point t1 = std::chrono::high_resolution_clock::now();
	for (int i = 0; i < num_times; ++i)
	{
		for (auto& val : vec) {
			val *= value;
			if (!wait(-100000)) {
				printf("\nAborted!");
				return;
			}
		}
	}
	auto dur = std::chrono::high_resolution_clock::now() - t1;
	auto dur_ms = std::chrono::duration_cast<std::chrono::milliseconds>(dur);
	printf("\nType name: %s", typeid(T).name());
	printf("\n... Size in bytes : %d", sizeof(T));
	printf("\n... Vector lenth: %d", vecSize);
	printf("\n... num_times: %d", num_times);
	printf("\n... Vector time: %d ms", dur_ms);
}


DLLFUNC void run() {
	set(LOGFILE);
	sum_test1(100000000, 3.3f);
	sum_test1(100000000, 3.3f);
	sum_test1(100000000, 3.3f);
	sum_test1(100000000, 3.3);
	sum_test1(100000000, 3.3);
	sum_test1(100000000, 3.3);
	vec_test2(3000000,30, 3.3f);
	vec_test2(3000000,30, 3.3f);
	vec_test2(3000000,30, 3.3f);
	vec_test2(3000000, 30, 3.3);
	vec_test2(3000000, 30, 3.3);
	vec_test2(3000000, 30, 3.3);

	quit("!Done!");
}

/*  LOG OUTPUT:

Type name: float
... Size in bytes : 4
... Summation time: 2247 ms
... Summed value: 67108864.00000000
Type name: float
... Size in bytes : 4
... Summation time: 2121 ms
... Summed value: 67108864.00000000
Type name: float
... Size in bytes : 4
... Summation time: 2079 ms
... Summed value: 67108864.00000000
Type name: double
... Size in bytes : 8
... Summation time: 2133 ms
... Summed value: 330000000.62168288
Type name: double
... Size in bytes : 8
... Summation time: 2164 ms
... Summed value: 330000000.62168288
Type name: double
... Size in bytes : 8
... Summation time: 2100 ms
... Summed value: 330000000.62168288
Type name: float
... Size in bytes : 4
... Vector lenth: 3000000
... num_times: 30
... Vector time: 1849 ms
Type name: float
... Size in bytes : 4
... Vector lenth: 3000000
... num_times: 30
... Vector time: 1816 ms
Type name: float
... Size in bytes : 4
... Vector lenth: 3000000
... num_times: 30
... Vector time: 1822 ms
Type name: double
... Size in bytes : 8
... Vector lenth: 3000000
... num_times: 30
... Vector time: 1829 ms
Type name: double
... Size in bytes : 8
... Vector lenth: 3000000
... num_times: 30
... Vector time: 1851 ms
Type name: double
... Size in bytes : 8
... Vector lenth: 3000000
... num_times: 30
... Vector time: 1879 ms
Done!

*/

www.vitaltrades.com - andrew@vitaltrades.com

Re: Moving to floats from doubles? [Re: Zheka] #486101 06/09/22 01:14 06/09/22 01:14
Joined: Feb 2017 Posts: 1,725 Chicago AndrewAMD Serious User
AndrewAMD Serious User Joined: Feb 2017 Posts: 1,725 Chicago	Interestingly, the speeds were slightly improved in 64-bit Zorro (about 14% faster overall). Nonetheless, no advantage to float over double. www.vitaltrades.com - andrew@vitaltrades.com

Re: Moving to floats from doubles? [Re: Zheka] #486104 06/09/22 09:01 06/09/22 09:01
Joined: Jul 2000 Posts: 27,986 Frankfurt jcl Chief Engineer
jcl Chief Engineer Joined: Jul 2000 Posts: 27,986 Frankfurt	In the Zorro structs you can see that they use both float and double. They use double where speed matters, and float where memory footprint or file size matters. Internal calculations always use double. For using external libraries such as ta-lib, there is no choice anyway. No serious finance library has floats nowadays. They all expect data in double arrays.

Moderated by Petra