Announcements
The Google Cloud Community will be in read-only from July 16 - July 22 as we migrate to a new platform; refer to this community post for more details.

error in procesing data with google data flow

In the first step of Processing data with google flow, an error like this is coming while running the python/ .df05.py command

WARNING:root:Make sure that locally built Python SDK docker image has Python 3.7 interpreter.

Solved Solved
0 1 1,629
1 ACCEPTED SOLUTION

Dear Gouri,

First, please try to run the command with python3, like python3 ./df05.py.

If this did not work, please be sure that you run the df05.py in the environment with python 3.7 installed. You can check it by entering python3 --version  command. If you have Python 3.8.X version, then df05.py won't start.

I think I had a similar problem after I did steps before starting df05.py as the following (all taken from the lab instruction): 

git clone https://github.com/GoogleCloudPlatform/data-science-on-gcp/

cd ~/data-science-on-gcp/04_streaming/simulate

virtualenv data-sci-env -p python3

source data-sci-env/bin/activate

pip install timezonefinder pytz
pip install apache-beam[gcp]

then my python version was 3.8.X for whatever reasons and I could not execute the command python ./df05.py.

Then what I did I deactivated the virtual environment, installed necessary python libraries and run df05.py again - as shown below:

deactivate

pip install timezonefinder pytz
pip install apache-beam[gcp]

python ./df05.py (here I cannot remember did I do python or python3 - please try both)

This was the solution in my case. Hopefully it will help you too.

Best,

MC

 

View solution in original post

1 REPLY 1
Top Labels in this Space