Skip to content
GitLab
Menu
Projects
Groups
Snippets
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in / Register
Toggle navigation
Menu
Open sidebar
William Stonewall Monroe
horovod-environment
Commits
477d18b1
Commit
477d18b1
authored
Nov 13, 2018
by
William Stonewall Monroe
Browse files
Update README.md
parent
171b915e
Changes
1
Hide whitespace changes
Inline
Side-by-side
README.md
View file @
477d18b1
# horovod-environment
A yml and a set of instructions to build a functioning horovod environment for distributed learning using keras and tensorflow (and torch on Cheaha
\ No newline at end of file
A yml and a set of instructions to build a functioning horovod environment for distributed learning using keras and tensorflow (and torch on Cheaha
# load modules
module load Anaconda3/5.2.0
module load cuda91
module load OpenMPI/3.1.2-gcccuda-2018b
# create anaconda environment
Download distribLearn2.yml from this repo
conda env create -f distribLearn2.yml --name distributedLearning
source activate distributedLearning
conda update automat
pip uninstall horovod
pip install --no-cache-dir horovod
#navigate to an example
This can be downloaded from https://github.com/uber/horovod
cd /data/user/blazerid/horovod-master/examples/
mpirun -np 8 -H host1:4,host2:4 -bind-to none -map-by slot -mca pml ob1 -mca btl_tcp_if_include ib0 python keras_mnist.py
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment