Hi!
I was working with this notebook sample on Colab Enterprise: Vertex AI RLHF Notebook Sample on GitHub and the Vertex AI pipeline job fails when the training step starts.
The pipeline logs display an error that looks like a mismatch between numpy and pandas versions. That's the error:
ValueError: numpy.dtype size changed, may indicate binary incompatibility. Expected 96 from C header, got 88 from PyObject File "pandas/_libs/interval.pyx", line 1, in init pandas._libs.interval from pandas._libs.interval import Interval
But I seem to have the correct versions installed: numpy 1.26.4 and pandas 2.2.2. As stated in pandas' doc the versions should be compatible: https://pandas.pydata.org/pandas-docs/stable/whatsnew/v2.2.2.html
What you expected to happen:
The pipeline job to complete
Steps to reproduce:
-Open the notebook (Vertex AI RLHF Notebook Sample on GitHub)
-Fill the input fields
-Run cells up to 'Run the pipeline using Vertex AI Pipelines'
-Wait while the pipeline runs (6 - 8 minutes)
-Check Vertex AI UI and logs for error
I have checked other forums about this issue (pandas failure in flex app engine deployment ), but in all of them this issue was cause by incompatible versions.
Uptade: The issue happens when running the notebook without modifying it (changing dataset and params) from GitHub and when modified as well.
Any help would be appreciated!
Hi gabrielmc,
Welcome to the Google Cloud Community!
The error "ValueError: numpy.dtype size changed, may indicate binary incompatibility" is typically caused by mismatched binary builds of these packages. Here are the potential solution to resolve the issue:
You may also check this documentation for more information about Vertex AI Pipelines.
If the issue persists, you may reach out to Google Cloud Support. When reaching out, include detailed information and relevant screenshots of the errors you’ve encountered. This will assist them in diagnosing and resolving your issue more efficiently.
Was this helpful? If so, please accept this answer as “Solution”. If you need additional assistance, reply here within 2 business days and I’ll be happy to help.
Thanks a lot for the reply.
I tried that and it didn't work, unfortunately.
I decided to run the sample notebook as it is in Vertex AI's GitHub (with the provided sample datasets) in a new runtime instance and I got the same error. So the issue is not dataset related and it's likely not something that I did wrong.
It would be helpful to know if other developers are having the same issue running this notebook.
User | Count |
---|---|
2 | |
1 | |
1 | |
1 | |
1 |