Révision 25 - Historique - Install the platform - COMPASS - Projets du LESIA

Install the platform » Historique » Révision 25

Révision 24 (pierre kestener, 20/01/2014 11:35) → Révision 25/62 (pierre kestener, 20/01/2014 11:59)

{{toc}} 

 h1. Install the platform without MAGMA 

 The COMPASS platform is distributed as a single bundle of CArMA and SuTrA libraries and YoGA and its AO extension for Yorick.  

 h2. Hardware requirements 

 The system must contain at least an x86 CPU and a CUDA capable GPU. list of compatible GPUs can be found here http://www.nvidia.com/object/cuda_gpus.html. Specific requirements apply to clusters (to be updated). 

 h2. Environment requirements 

 The system must be running a 64 bit distribution of Linux or Mac OS with the latest NVIDIA drivers and "CUDA toolkit":https://developer.nvidia.com/cuda-downloads. The installation of the corresponding version of the "CULA tools":http://www.culatools.com/downloads/dense/ is also required. The following installation instructions are valid if the default installation paths have been selected for these components. 

 Additionally, to benefit from the user-oriented features of the platform, Yorick should be installed as well as the latest version of Python and the associated pygtk module.  

 To install Yorick, download the latest version from the github repository: 
 <pre> 
 git clone https://github.com/dhmunro/yorick.git yorick.git 
 </pre> 
 then cd onto the created directory and install: 
 <pre> 
 ./configure && make && make install 
 </pre> 
 once Yorick is locally installed, you will have to add this directory : yorick.git/relocate/bin to your PATH to have an easy access to the yorick executable. You may want to add support for command history by using rlwrap and alias the yorick executable as : 
 <pre> 
 alias yorick='rlwrap path_to_yorick_executable/yorick' 
 </pre> 


 h2. Installation process 

 First check out the latest version from the svn repository : 
 <pre> 
 svn co https://version-lesia.obspm.fr/repos/compass compass 
 </pre> 
 then go in the newly created directory and then trunk: 
 <pre> 
 cd compass/trunk 
 </pre> 
 once there, you need to modify system variables in the define_var.sh executable : 
 <pre> 
 emacs define_var.sh 
 </pre> 
 in this file define properly CUDA_ROOT, CULA_ROOT and YoGA path. Note that for the latter, as YoGA is distributed with SUTrA you should just point to the newly created trunk directory. On a Linux system you should normally have: 
 <pre> 
 export CUDA_ROOT=/usr/local/cuda 
 export CULA_ROOT=/usr/local/cula 
 export YOGA_DIR=/home/MyUserName/path2compass/trunk 
 </pre> 
 in this file, you also have to indicate the proper architecture of your GPU so as the compiler will generate the appropriate code. Modify the following line: 
 <pre> 
 export GENCODE="arch=compute_12,code=sm_12" 
 </pre> 
 and change both 12 to your architecture : for instance a Tesla Fermi will have 2.0 computing capabilities so change 12 to 20, a Kepler GPU will have 3.0 or 3.5 (K20) computing capabilities, change 12 to 30 (or 35). 

 Once this is done, you're ready to compile the whole library. First run define_var.sh to define the system variables that will be used during the compilation process: 
 <pre> 
 ./define_var.sh 
 </pre> 

 then identify the absolute path to your Yorick executable using:  
 <pre> 
 which yorick 
 </pre> 
 and run the compilation script: 
 <pre> 
 ./reinstall absolute_path_to_yorick 
 </pre> 

 If you did not get any error, CArMA, SuTrA and YoGA are now installed on your machine. You can check that everything is working by launching a GUI to test a simulation: 
 <pre> 
 yorick -i yoga_ao/ywidgets/widget_ao.i 
 </pre> 

 h1. Install the platform with MAGMA 

 h2. Why MAGMA ? 

 The MAGMA project aims to develop a dense linear algebra library similar to LAPACK but for heterogeneous/hybrid architectures, starting with current "Multicore+GPU" systems. 

 Unlike CULA, MAGMA propose a dense linear algebra library handling double for free. 

 But MAGMA needs a LAPACK and a BLAS implementation. Actually, we try two options : ATLAS BLAS (free, easy to install) and MKL (free, need a registration but more powerful) 

 h2. Dependencies : gfortran 

 Use your package manager to install dependencies: 
 * on scientific linux : yum install gcc-gfortran libgfortran 
 * on debian : apt-get install gfortran gfortran-multilib 

 h2. Configure MAGMA with ATLAS 

 

 h3. Dependencies : blas, lapack, atlas 

 Use your package manager to install dependencies: 
 * on scientific linux : yum install blas-devel lapack-devel atlas-devel 
 * on debian : apt-get install libblas-dev liblapack-dev libatlas-base-dev libatlas-dev 

 The binary packages of ATLAS (and also OpenBLAS / GotoBLAS2) distributed by your Linux distribution (SL, Fedora, Debian,...) are generic packages, which are not optimized for a specific machine. 
 It is strongly advised to recompile ATLAS on your local machine to get best performances. 

 IMPORTANT NOTE: when building ATLAS, you must ensure that cpu throtling is disabled (if not timing measurement are erroneous, which may lead to an unoptimized build of ATLAS); see page 
 http://math-atlas.sourceforge.net/atlas_install/node5.html 


 h3. extraction 

 MAGMA is available here : http://icl.cs.utk.edu/magma/software/index.html 

 extract the tgz file and go into the new directory 
 > ~$ tar xf magma-1.4.1-beta.tar.gz 
 > ~$ cd magma-1.4.1 

 h3. configuration 

 You have to create your own make.inc : 

 * example on a scientific linux : *please verify GPU_TARGET, LAPACKDIR, ATLASDIR, CUDADIR* 

 <pre><code class="Makefile"> 
 #////////////////////////////////////////////////////////////////////////////// 
 #     -- MAGMA (version 1.4.1) -- 
 #        Univ. of Tennessee, Knoxville 
 #        Univ. of California, Berkeley 
 #        Univ. of Colorado, Denver 
 #        November 2013 
 #////////////////////////////////////////////////////////////////////////////// 

 # GPU_TARGET specifies for which GPU you want to compile MAGMA: 
 #       "Tesla"    (NVIDIA compute capability 1.x cards) 
 #       "Fermi"    (NVIDIA compute capability 2.x cards) 
 #       "Kepler" (NVIDIA compute capability 3.x cards) 
 # See http://developer.nvidia.com/cuda-gpus 

 GPU_TARGET ?= Fermi 

 CC          = gcc 
 NVCC        = nvcc 
 FORT        = gfortran 

 ARCH        = ar 
 ARCHFLAGS = cr 
 RANLIB      = ranlib 

 OPTS        = -fPIC -O3 -DADD_ -fopenmp -DMAGMA_SETAFFINITY 
 F77OPTS     = -fPIC -O3 -DADD_ 
 FOPTS       = -fPIC -O3 -DADD_ -x f95-cpp-input 
 NVOPTS      =         -O3 -DADD_ -Xcompiler "-fno-strict-aliasing -fPIC" 
 LDOPTS      = -fPIC -fopenmp 

 # Depending on how ATLAS and LAPACK were compiled, you may need one or more of: 
 LIB         = -llapack -lf77blas -latlas -lcblas -lcublas -lcudart -lstdc++ -lm -lgfortran 

 # define library directories here or in your environment 
 LAPACKDIR = /usr/lib64 
 ATLASDIR    = /usr/lib64/atlas 
 CUDADIR     = /usr/local/cuda 

 LIBDIR      = -L$(LAPACKDIR) \ 
             -L$(ATLASDIR) \ 
             -L$(CUDADIR)/lib64 

 INC         = -I$(CUDADIR)/include 
 </code></pre> 

 * example on debian : *please verify GPU_TARGET, LAPACKDIR, ATLASDIR, CUDADIR* 
 <pre><code class="Makefile"> 
 #////////////////////////////////////////////////////////////////////////////// 
 #     -- MAGMA (version 1.4.1) -- 
 #        Univ. of Tennessee, Knoxville 
 #        Univ. of California, Berkeley 
 #        Univ. of Colorado, Denver 
 #        November 2013 
 #////////////////////////////////////////////////////////////////////////////// 

 # GPU_TARGET specifies for which GPU you want to compile MAGMA: 
 #       "Tesla"    (NVIDIA compute capability 1.x cards) 
 #       "Fermi"    (NVIDIA compute capability 2.x cards) 
 #       "Kepler" (NVIDIA compute capability 3.x cards) 
 # See http://developer.nvidia.com/cuda-gpus 

 GPU_TARGET ?= Fermi 

 CC          = gcc 
 NVCC        = nvcc 
 FORT        = gfortran 

 ARCH        = ar 
 ARCHFLAGS = cr 
 RANLIB      = ranlib 

 OPTS        = -fPIC -O3 -DADD_ -fopenmp -DMAGMA_SETAFFINITY 
 F77OPTS     = -fPIC -O3 -DADD_ 
 FOPTS       = -fPIC -O3 -DADD_ -x f95-cpp-input 
 NVOPTS      =         -O3 -DADD_ -Xcompiler "-fno-strict-aliasing -fPIC"  
 LDOPTS      = -fPIC -fopenmp 

 # Depending on how ATLAS and LAPACK were compiled, you may need one or more of: 
 LIB         = -llapack -lf77blas -latlas -lcblas -lcublas -lcudart -lstdc++ -lm -lgfortran 

 # define library directories here or in your environment 
 LAPACKDIR = /usr/lib 
 ATLASDIR    = /usr/lib 
 CUDADIR     = /usr/local/cuda 

 LIBDIR      = -L$(LAPACKDIR) \ 
             -L$(ATLASDIR) \ 
             -L$(CUDADIR)/lib64 \ 
             -L/usr/lib/x86_64-linux-gnu 

 INC         = -I$(CUDADIR)/include 
 </code></pre> 

 h2. Configure MAGMA with MKL 

 h3. extraction 

 To download MKL, you have to create a account here : https://registrationcenter.intel.com/RegCenter/NComForm.aspx?ProductID=1517 

 extract l_ccompxe_2013_sp1.1.106.tgz and go into l_ccompxe_2013_sp1.1.106 

 install it with ./install_GUI.sh and add IPP stuff to default choices 

 h3. configuration 

 * example on debian : *please verify GPU_TARGET, MKLROOT, CUDADIR* 
 <pre><code class="Makefile"> 
 #////////////////////////////////////////////////////////////////////////////// 
 #     -- MAGMA (version 1.4.1-beta2) -- 
 #        Univ. of Tennessee, Knoxville 
 #        Univ. of California, Berkeley 
 #        Univ. of Colorado, Denver 
 #        December 2013 
 #////////////////////////////////////////////////////////////////////////////// 

 # GPU_TARGET contains one or more of Tesla, Fermi, or Kepler, 
 # to specify for which GPUs you want to compile MAGMA: 
 #       Tesla    - NVIDIA compute capability 1.x cards 
 #       Fermi    - NVIDIA compute capability 2.x cards 
 #       Kepler - NVIDIA compute capability 3.x cards 
 # The default is all, "Tesla Fermi Kepler". 
 # See http://developer.nvidia.com/cuda-gpus 
 # 
 GPU_TARGET ?= Fermi 

 CC          = gcc 
 NVCC        = nvcc 
 FORT        = gfortran 

 ARCH        = ar 
 ARCHFLAGS = cr 
 RANLIB      = ranlib 

 OPTS        = -fPIC -O3 -DADD_ -Wall -fno-strict-aliasing -fopenmp -DMAGMA_WITH_MKL -DMAGMA_SETAFFINITY 
 F77OPTS     = -fPIC -O3 -DADD_ -Wall 
 FOPTS       = -fPIC -O3 -DADD_ -Wall -x f95-cpp-input 
 NVOPTS      =         -O3 -DADD_ -Xcompiler "-fno-strict-aliasing -fPIC" 
 LDOPTS      = -fPIC -fopenmp 

 # gcc with MKL 10.3, Intel threads 
 LIB         = -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -lpthread -lcublas -lcudart -lstdc++ -lm -liomp5 -lgfortran 

 # define library directories preferably in your environment, or here. 
 # for MKL run, e.g.: source /opt/intel/composerxe/mkl/bin/mklvars.sh intel64 
 MKLROOT ?= /opt/intel/composerxe/mkl 
 CUDADIR ?= /usr/local/cuda 
 -include make.check-mkl 
 -include make.check-cuda 

 LIBDIR      = -L$(MKLROOT)/lib/intel64 \ 
             -L$(CUDADIR)/lib64 

 INC         = -I$(CUDADIR)/include -I$(MKLROOT)/include 
 </code></pre> 

 In this example, I use gcc but with MKL, you can use icc instead of gcc. In this case, you have to compile yorick with icc. For this, you have to change the CC flag in Make.cfg   

 h2. compilation and installation 

 h3. compilation 

 just compile the shared target (and test if you want) 
 > ~$ make -j 8 shared 

 h3. installation 

 To install libraries and include files in a given prefix, run: 
 > ~$ make install prefix=/usr/local/magma 
  
 The default prefix is /usr/local/magma. You can also set prefix in make.inc. 

 h3. tune (not tested) 

 For multi-GPU functions, set $MAGMA_NUM_GPUS to set the number of GPUs to use. 
 For multi-core BLAS libraries, set $OMP_NUM_THREADS or $MKL_NUM_THREADS or $VECLIB_MAXIMUM_THREADS to set the number of CPU threads, depending on your BLAS library. 

 h2. Platform installation 

 Just just define $MAGMA_PATH and use the standard procedure

Projet

Général

Profil

COMPASS

Install the platform » Historique » Révision 25