RubyPDF Blog Data Mining Install Pandas on Dreamhost

Install Pandas on Dreamhost

pandas is an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language.

Dreamhost has its own system installation of Python, but we can install an self-contained version of Python and its packages, and here is the method to install Pandas,

  • Install on a User Account
export PY_SUITE=~/opt/py/suite
  • Download Python3.4.3 source code
cd ~/opt/py/suite/src/
wget http://www.python.org/ftp/python/3.4.3/Python-3.4.3.tgz
  • Compile and install Python
cd ~/opt/py/suite/build/
tar -xzvf ../src/Python-3.4.3.tgz
cd Python-3.4.3/
./configure  --prefix=$PY_SUITE
make -j4
make install
  • Add env settings to .bash_profile
#Shell variable used during installation
export PY_SUITE=~/opt/py/suite
export PATH=$PY_SUITE/bin:$PATH
export LD_LIBRARY_PATH=$PY_SUITE/lib:$LD_LIBRARY_PATH  
  • Install Pandas Library
easy_install-3.4 pandas
  •  Try Pandas on Dreamhost
  • [glenlossie]$ python3
    Python 3.4.3 (default, Apr 14 2015, 15:42:34)
    [GCC 4.6.3] on linux
    Type "help", "copyright", "credits" or "license" for more information.
    >>> import pandas as pd
    >>> df = pd.read_csv('top1m.csv')
    >>> df.columns
    Index(['index', 'domain'], dtype='object')
    >>> df
     index domain
    0 1 google.com
    1 2 facebook.com
    2 3 youtube.com
    3 4 yahoo.com
    
    ... ... ...
    
    999998 999999 profacts.ru
    999999 1000000 onad.com
    [1000000 rows x 2 columns]
    >>>

pandas00pandas01

3 thoughts on “Install Pandas on Dreamhost”

  1. Sometimes, Dreamhost will kill the process because or high cpu or memory usage,

    Yikes! One of your processes (python3, pid 26350) was just killed for excessive resource usage.
    Please contact DreamHost Support for details.

Leave a Reply

This site uses Akismet to reduce spam. Learn how your comment data is processed.