I'm diving head first into MPI and related PPing since:
1. its fun and I always wanted to do it!
2. I need to do it for my summer project at IISc which will otherwise take months on the single/dual processor machines.
I was surprised when Hollow told me that you did not have the full manual for MPI (MIT press).
I am a bit aware of some of the problems you ran into while setting up the Beowulf cluster. I will try to get solutions for these since there are people here who are proficient at this sort of stuff.
In any case, I will be adding some notes which I feel are important from the reference manual under this tag. Also, I'll post some bioinformatics problems that can be done when the cluster is up and running on all its feet.
Subscribe to:
Post Comments (Atom)
2 comments:
yo baby!!! thanks a lot! really havent seen this one
Hi!
I have been trying to make a cluster of 3 PCs. My master is ajitup. when I run mpd on node 1 & 2, I get the following error on nodes:
----------------------
mpd -h ajitup -p 7573 &
[1] 20720
ajitup@plasma:~> plasma_29327: conn error in connect_rhs: Connection refused
plasma_29327 (connect_rhs 961): failed to connect to rhs at 127.0.0.1 7573
plasma_29327 (enter_ring 868): rhs connect failed
plasma_29327 (run 252): failed to enter ring
----------------------
Following error comes on master simultaneously:
ajitup_7573 (handle_rhs_input 1093): lost rhs; re-entering ring
ajitup_7573 (reenter_ring 825): reenter_ring rc=0 after numTries=1
ajitup_7573 (handle_rhs_input 1098): back in ring
ajitup_7573 (handle_rhs_input 1093): lost rhs; re-entering ring
ajitup_7573 (reenter_ring 825): reenter_ring rc=0 after numTries=1
ajitup_7573 (handle_rhs_input 1098): back in ring
COULD YOU PLEASE HELP ???
MY EMAIL:ajitup73@gmail.com, ajitup@cat.ernet.in
THANKS!
Post a Comment