is there a way to speed this up? the script itself is very fast, but the action of actually calling the stored procedure takes over 60 seconds. For example, I run the CALL statement starting at 3:24:38PM. The first log statement is not printed until 3:25:53pm. What is happening in those 75 seconds, between starting the run with the CALL statement, and actually executing the pyspark script?
Is it possible the 75 seconds is the execution of the script, and the log statements (print statements within the pyspark script) are all dumped at the end? The first and last log statements are tagged within the same second, even though when I'm watching the log stream they do not come in at the same second.
Creation time: Jun 13, 2024, 3:24:38 PM UTC-4