Cloudera, one of the leading distributions of Hadoop, provides an easy way to install Virtual Machine for the purposes of getting started quickly on their platform. In this article, we take a look at the installation of Cloudera QuickStarts VM. Step 4: Open a virtual machine by selecting previously downloaded and extracted file “cloudera-quickstart-vm-5.12.0-0-vmware.vmx” as shown below. Step 3: Open up the installed VMWare Workstation Plyer for non-commercial use to learn Cloudera. Step 2: Extract the downloaded “cloudera-quickstart-vm-5.12.0-0-vmware.zip”. I use MAC environment for my work, but Windows is an equally viable option. ![]() ![]() However, if you are not satisfied with its speed or the default cluster and need to practice Hadoop commands, then you can set up your own PySpark Jupyter Notebook environment within Cloudera QuickStart VM as outlined below. ![]() Databricks community edition is an excellent environment for practicing PySpark related assignments.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |