Parallel-Face-Recovery-Recognition

Parallel Implementation of Nearest Neighbor (image match) algorithm and linear interpolation (occlusion recovery) algorithm; using MPI and CUDA.

How to Run (Ensure access to CUDA, MPI, and clockcycle.h); Can also use run.mk:

mpixlc -g face.c -c -o face-mpi.o
nvcc -g -G face.cu -c -o face-cuda.o
mpicc -g face-mpi.o face-cuda.o -o face-exe -L/usr/local/cuda-11.2/lib64/ -lcudadevrt -lcudart -lstdc++ -lm
mpirun --bind-to core --report-bindings -np 1 face-exe 1 1 1024

The first argument chooses which training file for input (1 indicates training file with 1*360 images)

The second argument chooses which testing file for input (1 indicates training file with 1*360 images)

The third argument chooses number of threads per block for CUDA

How to run comments also present in face-serial.c, face-mpi.c, and face.c

Test Cases:

Sequential:

Train 1, Test 1
Train 2, Test 2
Train 4, Test 4
Train 8, Test 8
Train 16, Test 16

Strong Scaling Study 1 (Train 16, Test 16, No GPU, face-mpi.c):

Rank 1
Ranks 2
Ranks 4
Ranks 8
Ranks 12
Ranks 24
Ranks 36

Strong Scaling Study 2 (Train 16, Test 16, 1 GPU):

Ranks 1, Blocksize 1
Ranks 2, Blocksize 8
Ranks 4, Blocksize 16
Ranks 8, Blocksize 32
Ranks 12, Blocksize 128
Ranks 24, Blocksize 512
Ranks 36, Blocksize 1024

Weak Scaling Study(Blocksize 1024):

1 GPU/Rank, Train 1, Test 1
2 GPU/Rank, Train 2, Test 2
3 GPU/Rank, Train 3, Test 3
4 GPU/Rank, Train 4, Test 4
5 GPU/Rank, Train 5, Test 5
6 GPU/Rank, Train 6, Test 6

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
face_images		face_images
graphs		graphs
FillDigits.py		FillDigits.py
LICENSE		LICENSE
README.md		README.md
Report.pdf		Report.pdf
Sequential.txt		Sequential.txt
StrongScale1.txt		StrongScale1.txt
StrongScale2.txt		StrongScale2.txt
WeakScale.txt		WeakScale.txt
a.out		a.out
clockcycle.h		clockcycle.h
display.py		display.py
face-mpi.c		face-mpi.c
face-serial.c		face-serial.c
face.c		face.c
face.cu		face.cu
faces_testx1.csv		faces_testx1.csv
faces_testx16.csv		faces_testx16.csv
faces_testx2.csv		faces_testx2.csv
faces_testx3.csv		faces_testx3.csv
faces_testx4.csv		faces_testx4.csv
faces_testx5.csv		faces_testx5.csv
faces_testx6.csv		faces_testx6.csv
faces_testx8.csv		faces_testx8.csv
faces_train360x1.csv		faces_train360x1.csv
faces_train360x16.csv		faces_train360x16.csv
faces_train360x2.csv		faces_train360x2.csv
faces_train360x3.csv		faces_train360x3.csv
faces_train360x4.csv		faces_train360x4.csv
faces_train360x5.csv		faces_train360x5.csv
faces_train360x6.csv		faces_train360x6.csv
faces_train360x8.csv		faces_train360x8.csv
match.txt		match.txt
mount.sh		mount.sh
occlusion_recovery.txt		occlusion_recovery.txt
output.txt		output.txt
run.mk		run.mk
run1.sh		run1.sh
run_commands.sh		run_commands.sh
run_nodes.sh		run_nodes.sh
speedup.txt		speedup.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parallel-Face-Recovery-Recognition

How to Run (Ensure access to CUDA, MPI, and clockcycle.h); Can also use run.mk:

Test Cases:

Sequential:

Strong Scaling Study 1 (Train 16, Test 16, No GPU, face-mpi.c):

Strong Scaling Study 2 (Train 16, Test 16, 1 GPU):

Weak Scaling Study(Blocksize 1024):

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

sval4/Parallel-Face-Recovery-Recognition

Folders and files

Latest commit

History

Repository files navigation

Parallel-Face-Recovery-Recognition

How to Run (Ensure access to CUDA, MPI, and clockcycle.h); Can also use run.mk:

Test Cases:

Sequential:

Strong Scaling Study 1 (Train 16, Test 16, No GPU, face-mpi.c):

Strong Scaling Study 2 (Train 16, Test 16, 1 GPU):

Weak Scaling Study(Blocksize 1024):

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages