Installation Pre-requisites
Below are the Pre-requisites before installing Sparkflows:
- JDK 1.8+ installed on the machine;
- java and jar have to be in the PATH.
- Download Winutils and Set Path
- If running on an Apache Spark cluster, Apache Spark 2.3+ is needed on the cluster.
- 8GB+ of RAM must be available on the machine.
- Python 3.9+ ( when running Python and Pyspark, otherwise not needed )
With which user should Sparkflows be installed
If Sparkflows needs to be connected with an Apache Spark cluster the below is needed:
- Sparkflows needs to be installed as a user which can impersonate other users. Impersonation for this user has to be set up in HDFS configs.
- If you disable impersonation in Sparkflows, then the user with which Fire is installed needs to be able to submit jobs to the cluster.
More Details are available here : https://www.sparkflows.io/connecting-sparkflows-with-spark-cl
I do not see anything in my browser after I start Sparkflows
Do check in the logs for exceptions and the root cause. The log files are in fireserver.log. Possible causes are:
- The H2 database was not created and it is failing to the find the table.
- The server did not start properly because some other Application is running on the configured port. The default configured port for Sparkflows is : 8080
The http and https ports for Sparkflows can be updated in conf/application.properties.
Fire UI does not get displayed when I go to :8080. Some other UI is displayed.
Sparkflows by default runs on port 8080. It is possible that you have some other application running on port 8080, and you are seeing its output. In this case, the solution is to run the Sparkflows server on some other port which is not being used by any other application. Details for running Sparkflows on another port is here : Running on Custom Port — Sparkflows 3.0 documentation
On certain pages I get the error : Could not connect to fire at local host.
The reason for it is the the Sparkflows process is not starting. Check out the log files : fireserver.log and fire.log. If you are using Ubuntu, you may have to do the following:
- sudo dpkg-reconfigure dash
- https://unix.stackexchange.com/questions/442510/how-to-use-bash-for-sh-in-ubuntu
Select No