Why doesn’t Select Records preserve the original Excel row order?

Sanskar · January 31, 2026, 6:20am

It actually does — but not the way you might expect.

Spark DataFrames have no guaranteed row order.
To make row selection deterministic, the node assigns an internal row number using:

This creates a stable processing order, not a semantic one.

Key insight:
Row position here means execution order, not “Excel row number”.

Practical tip
If row order is business-critical, add an explicit sort column node before Select Records.

Topic	Replies	Views
How Column Order Works in the Select Node FAQs	9	January 14, 2026
Why does Sparkflows add fileName and sheetName at the very end of Read Excel Advanced? FAQs	4	January 27, 2026
Select vs Dynamic Select (What’s the Difference?) FAQs user-guide	8	January 15, 2026
I want to sort incoming columns in a particular order Data Preparation	9	December 17, 2025
When “Top N” and “After Record” are both set, which one wins? General faq	12	January 31, 2026