datascience_docker

Docker settings (Dockerfile etc.) for data science

Default port open for Jupyter/PySpark access is 8123

starterpack.sh

Bash Script to run after initializing Cloud instance
- AWS EC2, GCP Compute Engine, etc...
Install Docker, set up basic useful aliases
To Run:
- configure chmod: sudo chmod 774 starterpack.sh
- run script: ./starterpack.sh

Latest images

etture/ybigta_img_python:1.2: basic Python image with Anaconda
etture/ybigta_img_hadoop:1.2: image with Hadoop
etture/ybigta_img_spark:1.2.2: image with Spark

etture/ybigta_img_python:1.2

Size: 4.19 GB
Pull: sudo docker pull etture/ybigta_img_python:1.2
Run: sudo docker run -d -it -p 8123:8123 --name=ybigta-python etture/ybigta_img_python:1.2 /bin/bash
Exec: sudo docker exec -it ybigta-python /bin/bash

etture/ybigta_img_crawling:1.1

Size: 5.14 GB
Pull: sudo docker pull etture/ybigta_img_crawling:1.1
Run: sudo docker run -d -it -p 8123:8123 --name=ybigta-crawling etture/ybigta_img_crawling:1.1 /bin/bash
Exec: sudo docker exec -it ybigta-crawling /bin/bash

etture/ybigta_img_hadoop:1.2

Size: 6.68 GB
Pull: sudo docker pull etture/ybigta_img_hadoop:1.2
Run: sudo docker run -d -it -p 8123:8123 --name=ybigta-hadoop etture/ybigta_img_hadoop:1.2 /bin/bash
Exec: sudo docker exec -it ybigta-hadoop /bin/bash

etture/ybigta_img_spark:1.2.4

Size: 7.67 GB
Pull: sudo docker pull etture/ybigta_img_spark:1.2.4
Run: sudo docker run -d -it -p 8123:8123 -p 4040:4040 --name=ybigta-spark etture/ybigta_img_spark:1.2.4 /bin/bash
Exec: sudo docker exec -it ybigta-spark /bin/bash

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
ybigta_img_crawling		ybigta_img_crawling
ybigta_img_hadoop		ybigta_img_hadoop
ybigta_img_python		ybigta_img_python
ybigta_img_spark		ybigta_img_spark
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
starterpack.sh		starterpack.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

datascience_docker

starterpack.sh

Latest images

etture/ybigta_img_python:1.2

etture/ybigta_img_crawling:1.1

etture/ybigta_img_hadoop:1.2

etture/ybigta_img_spark:1.2.4

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

etture/datascience_docker

Folders and files

Latest commit

History

Repository files navigation

datascience_docker

starterpack.sh

Latest images

etture/ybigta_img_python:1.2

etture/ybigta_img_crawling:1.1

etture/ybigta_img_hadoop:1.2

etture/ybigta_img_spark:1.2.4

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages