Search

Home

Sort by

Full Search
Definition
Symbol
File Path
History
Type

In Project(s)

Searched hist:79971 (Results 1 - 1 of 1) sorted by relevance

/haiku/src/system/libroot/posix/string/arch/x86_64/
H A D	arch_string.cpp	diff 4582b6e3 Wed Sep 10 14:09:57 MDT 2014 Paweł Dziepak <pdziepak@quarnos.org> libroot/x86_64: new memcpy implementation This patch introduces new memcpy() implementation that improves the performance when the buffer is small. It was written for processors that support ERMSB, but performs reasonably well on older CPUs as well. The following benchmarks were done on Haswell i7 running Debian Jessie with Linux 3.16.1. In each iteration 64MB buffer was copied, the parameter "size" is the size of the buffer passed in a single call (i.e. for "size: 2" memcpy() was called ~32 million times to copy the whole 64MB). f - original implementation, g - new implementation, all buffers 16 byte aligned cpy, size: 8, f: 79971 µs, g: 20419 µs, ∆: 74.47% cpy, size: 32, f: 42068 µs, g: 12159 µs, ∆: 71.10% cpy, size: 128, f: 13408 µs, g: 10359 µs, ∆: 22.74% cpy, size: 512, f: 10634 µs, g: 10433 µs, ∆: 1.89% cpy, size: 1024, f: 10474 µs, g: 10536 µs, ∆: -0.59% cpy, size: 4096, f: 9419 µs, g: 8630 µs, ∆: 8.38% f - glibc 2.19 implementation, g - new implementation, all buffers 16 byte aligned cpy, size: 8, f: 26299 µs, g: 20919 µs, ∆: 20.46% cpy, size: 32, f: 11146 µs, g: 12159 µs, ∆: -9.09% cpy, size: 128, f: 10778 µs, g: 10354 µs, ∆: 3.93% cpy, size: 512, f: 12291 µs, g: 10426 µs, ∆: 15.17% cpy, size: 1024, f: 13923 µs, g: 10571 µs, ∆: 24.08% cpy, size: 4096, f: 11770 µs, g: 8671 µs, ∆: 26.33% f - glibc 2.19 implementation, g - new implementation, all buffers unaligned cpy, size: 16, f: 13376 µs, g: 13009 µs, ∆: 2.74% cpy, size: 32, f: 11130 µs, g: 12171 µs, ∆: -9.35% cpy, size: 64, f: 11017 µs, g: 11231 µs, ∆: -1.94% cpy, size: 128, f: 10884 µs, g: 10407 µs, ∆: 4.38% cpy, size: 256, f: 10826 µs, g: 10106 µs, ∆: 6.65% cpy, size: 512, f: 12354 µs, g: 10396 µs, ∆: 15.85% Signed-off-by: Paweł Dziepak <pdziepak@quarnos.org> diff 4582b6e3a3c960362d4f9f2f31b7c43887226fa2 Wed Sep 10 14:09:57 MDT 2014 Paweł Dziepak <pdziepak@quarnos.org> libroot/x86_64: new memcpy implementation This patch introduces new memcpy() implementation that improves the performance when the buffer is small. It was written for processors that support ERMSB, but performs reasonably well on older CPUs as well. The following benchmarks were done on Haswell i7 running Debian Jessie with Linux 3.16.1. In each iteration 64MB buffer was copied, the parameter "size" is the size of the buffer passed in a single call (i.e. for "size: 2" memcpy() was called ~32 million times to copy the whole 64MB). f - original implementation, g - new implementation, all buffers 16 byte aligned cpy, size: 8, f: 79971 µs, g: 20419 µs, ∆: 74.47% cpy, size: 32, f: 42068 µs, g: 12159 µs, ∆: 71.10% cpy, size: 128, f: 13408 µs, g: 10359 µs, ∆: 22.74% cpy, size: 512, f: 10634 µs, g: 10433 µs, ∆: 1.89% cpy, size: 1024, f: 10474 µs, g: 10536 µs, ∆: -0.59% cpy, size: 4096, f: 9419 µs, g: 8630 µs, ∆: 8.38% f - glibc 2.19 implementation, g - new implementation, all buffers 16 byte aligned cpy, size: 8, f: 26299 µs, g: 20919 µs, ∆: 20.46% cpy, size: 32, f: 11146 µs, g: 12159 µs, ∆: -9.09% cpy, size: 128, f: 10778 µs, g: 10354 µs, ∆: 3.93% cpy, size: 512, f: 12291 µs, g: 10426 µs, ∆: 15.17% cpy, size: 1024, f: 13923 µs, g: 10571 µs, ∆: 24.08% cpy, size: 4096, f: 11770 µs, g: 8671 µs, ∆: 26.33% f - glibc 2.19 implementation, g - new implementation, all buffers unaligned cpy, size: 16, f: 13376 µs, g: 13009 µs, ∆: 2.74% cpy, size: 32, f: 11130 µs, g: 12171 µs, ∆: -9.35% cpy, size: 64, f: 11017 µs, g: 11231 µs, ∆: -1.94% cpy, size: 128, f: 10884 µs, g: 10407 µs, ∆: 4.38% cpy, size: 256, f: 10826 µs, g: 10106 µs, ∆: 6.65% cpy, size: 512, f: 12354 µs, g: 10396 µs, ∆: 15.85% Signed-off-by: Paweł Dziepak <pdziepak@quarnos.org>

Completed in 18 milliseconds

asus-wl-520gu-7.0.1.45
asuswrt-rt-n18u-9.0.0.4.380.2695
barrelfish-2018-10-04
barrelfish-master
broadcom-cfe-1.4.2
darwin-on-arm
freebsd-10-stable
freebsd-10.0-release
freebsd-10.1-release
freebsd-10.2-release
freebsd-10.3-release
freebsd-11-stable
freebsd-11.0-release
freebsd-12-stable
freebsd-13-stable
freebsd-9.3-release
freebsd-current
fuchsia
haiku
haiku-buildtools
haiku-fatelf
haikuporter
haikuports
linux-master
macosx-10.10
macosx-10.10.1
macosx-10.5.8
macosx-10.9.5
netbsd-6-1-5-RELEASE
netbsd-current
netgear-R7000-V1.0.7.12_1.2.5
netgear-R7800-V1.0.2.28
netgear-WNDR4500-V1.0.1.40_1.0.68
netgear-WNDR4500v2-V1.0.0.60_1.0.38
openbsd-current
openjdk10
openjdk9
opensolaris-onvv-gate
openwrt
seL4-camkes-master
seL4-l4v-10.1.1
seL4-l4v-master
seL4-mcs-10.1.1
seL4-refos-master
seL4-test-master
u-boot
xnu-2422.115.4
xnu-2782.1.97