PyCudaIntro

This short talk is about the idea of numerical optimisation in python, using GMMs as an illuminating example and CUDA as the main solution. This introduces the basic concepts, terminology and syntax used in CUDA, as well as testing validity and time calculations.

This doesn't look into CUDA too deeply (for fears of it gazing back :P), or how PyCUDA works, but rather should be enough for you to get basic CUDA acceleration working for your Python Code.

This computational aspect was part of a thesis into emotion recognition using Markov Chain Monte Carlo. (Hence the 13 dimensional data (MFCCs) and 1 million likelihood evaluations). Code has had a significant facelift since being first written.

The equation being optimised is the log likelihood on slide 4.

Presented at July's Sydney Python meet.

Source

likelihood/
- simple.py - an inefficient pure python/numpy implementation written by me before realising computational time was a thing
- scikitLL.py - an implementation using scikit-learn's GMM code. Originally used the sklearn.mixture.GMM class but this has been deprecated and significantly sped up with the sklearn.mixture.gaussian_mixture.GaussianMixture class. Uses scipy and I suspect compiled fortran/C subroutines under the hood
- cudaLL.py - an implementation using PyCUDA and the my kernel as in kernel.cu. The heavy lifting is in the .cu file and the python file is just setting everything up.
- base.py - the object to mock/subclass for interchangeable use
- tests.py - some quick checks
timeRunner.py - runs the various implenetations with various powers of 10. Use --method to choose the method to check
testValidity - a pytest file to compare the results of the output using randomized input

Talk

Includes the .tex file and the .pdf file as compiled. Not a great complete reference, but a decent starting point.

Links

PyCUDA - the package used to abstract away the CUDA interface. Documentation is good enough, though a bit sparse and code itself doesn't have much documentation, which makes advanced usage tricky. Has a small section on metaprogramming CUDA too which is very interesting.
CUDA Intro from NVIDIA - very good intro to CUDA using pure C++ and full of useful links to more advanced understanding.
scikit-learn - a package with implentations of ML and used for their efficient code. Specific page on GMMs gives more background. Code is very well documented allowing relatively easy use in other applications.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
Talk		Talk
source		source
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PyCudaIntro

Source

Talk

Links

About

Uh oh!

Releases

Packages

Languages

License

nayyarv/PyCudaIntro

Folders and files

Latest commit

History

Repository files navigation

PyCudaIntro

Source

Talk

Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages