pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.
Dreamhost has its own system installation of Python, but we can install an self-contained version of Python and its packages, and here is the method to install Pandas,
- Install on a User Account
export PY_SUITE=~/opt/py/suite
- Download Python3.4.3 source code
cd ~/opt/py/suite/src/
wget http://www.python.org/ftp/python/3.4.3/Python-3.4.3.tgz
- Compile and install Python
cd ~/opt/py/suite/build/
tar -xzvf ../src/Python-3.4.3.tgz
cd Python-3.4.3/
./configure --prefix=$PY_SUITE
make -j4
make install
- Add env settings to .bash_profile
#Shell variable used during installation
export PY_SUITE=~/opt/py/suite
export PATH=$PY_SUITE/bin:$PATH
export LD_LIBRARY_PATH=$PY_SUITE/lib:$LD_LIBRARY_PATH
- Install Pandas Library
easy_install-3.4 pandas
- Â Try Pandas on Dreamhost
-
[glenlossie]$ python3 Python 3.4.3 (default, Apr 14 2015, 15:42:34) [GCC 4.6.3] on linux Type "help", "copyright", "credits" or "license" for more information. >>> import pandas as pd >>> df = pd.read_csv('top1m.csv') >>> df.columns Index(['index', 'domain'], dtype='object') >>> df index domain 0 1 google.com 1 2 facebook.com 2 3 youtube.com 3 4 yahoo.com ... ... ... 999998 999999 profacts.ru 999999 1000000 onad.com [1000000 rows x 2 columns] >>>
Sometimes, Dreamhost will kill the process because or high cpu or memory usage,
How can I get around the “excessive resource usage” error?
even the simplest example? or you really use many resources? in fact dreamhost does not suit data analysis.