fastest-matrices

This project benchmarks the following libraries:

hmatrix
dense-linear-algebra (refered to as DLA henceforth)
numhask
massiv ("Massiv (Par)" refers to parallel computation
matrix

To run:

(allocation) stack build :bench-alloc && stack exec bench-alloc

(runtime) stack build :bench-runtime && stack exec bench-runtime

Results

Runtime

Matrix-matrix multiplication

Library	n = 10	n = 50	n = 100
DLA	2.65 us	289.0 us	2.24 ms
Hmatrix	1.32 us	55.8 us	292.0 us
NumHask	714.0 us	63.5 ms	593.0 ms
Massiv	12.0 us	205.0 us	1.52 ms
Massiv (Par)	76.1 us	220.0 us	866.0 us
Matrix	12.6 us	1.1 ms	8.44 ms
Naive C	51 us	323 us	4.78 ms

Repeated matrix-matrix multiplication

Library	n = 10	n = 50	n = 100
DLA	8.25 us	852.0 us	6.92 ms
Hmatrix	5.41 us	170.0 us	889.0 us
NumHask	1.46 ms	152.0 ms	1.42 s
Massiv	38.9 us	629.0 us	4.48 ms
Massiv (Par)	358.0 us	816.0 us	2.8 ms

Matrix-vector multiplication

Library	n = 10	n = 50	n = 100
DLA	302.0 ns	4.12 us	16.1 us
Hmatrix	706.0 ns	2.27 us	11.1 us

QR factorization

Library	n = 10	n = 50	n = 100
DLA	3.32 us	233.0 us	1.7 ms
Hmatrix	94.0 us	6.62 ms	60.3 ms

Transpose

Library	n = 10	n = 50	n = 100
DLA	330.0 ns	8.46 us	25.9 us
Hmatrix	24.9 ns	24.4 ns	17.6 ns
NumHask	309.0 ns	7.29 us	28.1 us
Massiv	7.29 us	35.7 us	122.0 us
Matrix	4.58 us	130.0 us	699.0 us

Norm

Library	n = 10	n = 50	n = 100
DLA	189.0 ns	4.15 us	16.8 us
Hmatrix	285.0 ns	1.15 us	4.32 us
NumHask	40.3 us	1.79 ms	9.72 ms
Massiv	128.0 ns	3.3 us	13.0 us
Naive C	350 ns	12.65 us	40.96 μs

Row

Library	n = 10	n = 50	n = 100
DLA	26.4 ns	19.5 ns	19.5 ns
Hmatrix	1.43 us	1.63 us	1.7 us
NumHask	39.4 ns	170.0 ns	305.0 ns
Massiv	3.97 us	5.08 us	4.74 us
Matrix	40.9 ns	167.0 ns	310.0 ns

Column

Library	n = 10	n = 50	n = 100
DLA	61.0 ns	279.0 ns	295.0 ns
Hmatrix	1.43 us	1.7 us	1.84 us
NumHask	221.0 ns	1.04 us	2.38 us
Massiv	4.63 us	5.04 us	5.04 us
Matrix	350.0 ns	1.59 us	3.09 us

Identity

Library	n = 10	n = 50	n = 100
DLA	157.0 ns	4.75 us	11.2 us
Hmatrix	2.31 us	34.5 us	132.0 us
Matrix	2.94 us	65.9 us	492.0 us

Diagonal

Library	n = 10	n = 50	n = 100
DLA	124.0 ns	5.04 us	11.2 us
Hmatrix	2.15 us	33.7 us	132.0 us

Allocation

Matrix-matrix multiplication

Library	n = 10	n = 50	n = 100
DLA	976	20,176	80,176
hmatrix	904	20,936	80,936
NumHask	1,691,432	179,093,816	1,400,273,872
Massiv	5,816	140,216	560,216
Matrix	18,160	392,288	1,544,056

QR factorization

Library	n = 10	n = 50	n = 100
DLA	1,848	40,248	160,248
hmatrix	201,192	9,074,048	67,457,120

Transpose

Library	n = 10	n = 50	n = 100
DLA	880	20,080	80,080
hmatrix	64	64	64
NumHask	0	0	0
Massiv	872	20,072	80,072
Matrix	9,840	239,952	959,664

Norm

Library	n = 10	n = 50	n = 100
DLA	16	16	16
hmatrix	232	232	232
NumHask	146,800	2,919,552	11,641,752
Massiv	16	16	16

Row

Library	n = 10	n = 50	n = 100
DLA	64	64	64
hmatrix	2,128	2,128	2,128
NumHask	256	256	256
Massiv	144	464	864
Matrix	896	20,112	80,168

Column

Library	n = 10	n = 50	n = 100
DLA	160	480	880
hmatrix	2,128	2,128	2,128
NumHask	800	2,720	5,120
Matrix	1,648	23,744	87,400

Identity

Library	n = 10	n = 50	n = 100
DLA	1,008	20,528	80,928
hmatrix	3,208	66,440	252,440
Matrix	5,752	139,848	559,504

Relevant details:

the implementations of the "naive C" parts can be found in /naive. They were compiled with -O3
the massiv benchmarks use the Primitive representation, which seems to be the fastest among what massiv offers
the benchmarked functions from DLA are taken from the Fast module when available
the norm function is called on n*n vectors
instead of relying on hackage, the project's dependencies fetch the libraries directly from github (see stack.yaml).

Formely included:

bed-and-breakfast (abandoned because too slow)
matrices (abandoned because too slow, also see kaizhang/matrices#8)

TODO:

have a cleaner/more abstract interface for the benches

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
bench		bench
naive		naive
.gitignore		.gitignore
ChangeLog.md		ChangeLog.md
LICENSE		LICENSE
README.md		README.md
Setup.hs		Setup.hs
fastest-matrices.cabal		fastest-matrices.cabal
out10.json		out10.json
out100.json		out100.json
out50.json		out50.json
parse.py		parse.py
stack.yaml		stack.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fastest-matrices

Results

Runtime

Matrix-matrix multiplication

Repeated matrix-matrix multiplication

Matrix-vector multiplication

QR factorization

Transpose

Norm

Row

Column

Identity

Diagonal

Allocation

Matrix-matrix multiplication

QR factorization

Transpose

Norm

Row

Column

Identity

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

fastest-matrices

Results

Runtime

Matrix-matrix multiplication

Repeated matrix-matrix multiplication

Matrix-vector multiplication

QR factorization

Transpose

Norm

Row

Column

Identity

Diagonal

Allocation

Matrix-matrix multiplication

QR factorization

Transpose

Norm

Row

Column

Identity

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages