https://github.com/milahu/sum-parallel

summarize numbers from a text file using multiple CPU cores
https://github.com/milahu/sum-parallel

arithmetic arithmetic-summation arithmetics parallel-computing parallel-processing summarization

Last synced: 9 months ago
JSON representation

summarize numbers from a text file using multiple CPU cores

Host: GitHub
URL: https://github.com/milahu/sum-parallel
Owner: milahu
License: mit
Created: 2023-11-11T15:46:40.000Z (over 2 years ago)
Default Branch: main
Last Pushed: 2023-11-11T17:39:57.000Z (over 2 years ago)
Last Synced: 2025-02-06T09:37:11.556Z (about 1 year ago)
Topics: arithmetic, arithmetic-summation, arithmetics, parallel-computing, parallel-processing, summarization
Language: C
Homepage:
Size: 4.88 KB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: readme.md
- License: license.txt

Awesome Lists containing this project

README

          # sum-parallel

summarize numbers from a text file using multiple CPU cores

## status

the current [sum-parallel.c](sum-parallel.c) is slower than [sum.c](sum.c)

```

seq 10000000 >input.txt

gcc -o sum sum.c && time ./sum  adding two numbers takes a single CPU cycle, memory reads take about [100 CPU cycles](http://www.eecs.berkeley.edu/~rcs/research/interactive_latency.html) in 2016 hardware.

```console

$ pv -a input.txt >/dev/null

[1.52GiB/s]

$ pv -a input.txt | ./sum

[15.4MiB/s]

50000005000000

```

## read multiple numbers with fscanf

currently, i read only one number

```c

fscanf(fptr, "%d", &n)

```

but fscanf can read multiple values

```c

fscanf(fptr, "%d\n%d\n%d\n%d", &n1, &n2, &n3, &n4)

```

but no, fscanf is already buffered

https://stackoverflow.com/a/9587245/10440128

> When you use fread or the other file I/O functions in the C standard library, memory is buffered in several places.

> The C library will usually create a buffer for every FILE* you have open. Data is read into this buffers in large chunks. This allows fread to satisfy many small requests without having to make a large number of system calls, which are expensive. This is what people mean when they say fread is buffered.

> The kernel will also buffer files that are being read in the disk cache. This reduces the time needed for the read system call, since if data is already in memory, your program won't have to wait while the kernel fetches it from the disk. The kernel will hold on to recently read files, and it may read ahead for files which are being accessed sequentially.

## run multiple sum in parallel

this runs serial

```console

$ time { n=4; for i in $(seq $n); do seq $((1 + 10000000 / n * (i - 1))) $((10000000 / n * i)) | ./sum; done | ./sum; }

50000005000000

real    0m5.251s

user    0m5.850s

sys     0m0.442s

```

this runs parallel

```console

$ time { n=2; t=$(mktemp); p=; for i in $(seq $n); do seq $((1 + 10000000 / n * (i - 1))) $((10000000 / n * i)) | ./sum >>$t & p+=" $!"; done; wait $p; ./sum <$t; }

50000005000000

real    0m3.190s

user    0m6.549s

sys     0m0.577s

$ time { n=4; t=$(mktemp); p=; for i in $(seq $n); do seq $((1 + 10000000 / n * (i - 1))) $((10000000 / n * i)) | ./sum >>$t & p+=" $!"; done; wait $p; ./sum <$t; }

50000005000000

real    0m3.338s

user    0m9.707s

sys     0m0.874s

```

success, this is 2x faster than serial

`n=4` is not faster, because i have only 2 cpu cores

```console

$ grep -m1 "^cpu cores" /proc/cpuinfo

cpu cores       : 2

```

## see also

- [Shell command to sum integers, one per line?](https://stackoverflow.com/questions/450799/shell-command-to-sum-integers-one-per-line)

- [Add up a column of numbers at the Unix shell](https://stackoverflow.com/questions/926069/add-up-a-column-of-numbers-at-the-unix-shell)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/milahu/sum-parallel

Awesome Lists containing this project

README