How can I extract individual class probabilities from the probability VectorUDT column when using a Spark Random Forest model to predict customer churn?

Ragita · December 18, 2025, 12:27pm

To fulfill the aforementioned requirement, we can utilize a “Split Probability” node. This node is employed after the “Spark Predict” node and takes the probability column in array format as an input DataFrame. It performs the task of splitting the vectorudt into two separate columns. The first column, labelled as “prob0,” captures the probability of a customer not churning (i.e., 0), while the second column, labelled as “prob1,” captures the probability of a customer churning (i.e., 1). This enables an efficient segregation of the probability values for further analysis or decision-making.

Topic	Replies	Views
How can I obtain the probability of a prediction in a classification problem? Machine Learning	13	December 30, 2025
Telco Churn Prediction Example Machine Learning	8	December 29, 2025
When using pyspark node to build custom models, how can I push the metrics and any other information I want to print and display on screen to the UI? Machine Learning	8	December 30, 2025
How to split the input data into two outputs depending on the condition? Data Preparation	2	December 18, 2025
How can I interpret individual predictions and get feature importance (Shapley values)? Machine Learning user-guide , faq	15	December 19, 2025

How can I extract individual class probabilities from the probability VectorUDT column when using a Spark Random Forest model to predict customer churn?

Related topics