Get hands-on experience with 20+ free Google Cloud products and $300 in free credit for new customers.

Selecting Multiple files with wildcard function.

I am struggling with this and no one from Google support seems to know how it works.

I would like to select all files containing certain keywords in Fusion, Wrangle it to get desired output and load in Bigquery. I have tried these wildcard recipe's, but no success(https://cloud.google.com/storage/docs/gsutil/addlhelp/WildcardNames, https://cloud.google.com/bigquery/docs/external-data-cloud-storage?_ga=2.29954751.-1697255675.165329... and gs://decompressautomated/dcm_account381803_click_*\.csv).

Please assist. See example of files to be selected attached.Screenshot 2022-07-11 at 23.02.08.png

 

@kolban @Eduardo_Ortiz 

0 5 2,665
5 REPLIES 5

Just to play it back ... you have a GCS bucket that contains files.  You want to use GCP Data Fusion to read the files and load them into BigQuery.  However, you don't want to select ALL the files ... but instead only want to select a subset of that based on the file names?   What have you tried so far?

Here is the logs i ran.


--




*Anthony ApollisData Lead*
021 447 3344 <0214473344> | tmicollective.com
<>

3rd Floor, Central Building, Black River Park, 2 Fir Street, Observatory,
Cape Town, 7925

The content of this email is confidential and intended for the recipient
specified in message only. It is strictly forbidden to share any part of
this message with any third party, without a written consent of the sender.
If you received this message by mistake, please reply to this message and
follow with its degs i ranletion, so that we can ensure such a mistake does
not occur in the future.

Looking here ... https://cloud.google.com/storage/docs/gsutil/addlhelp/WildcardNames

I see in examples at that page that one doesn't appear to need to escape the "." (period)

In your example you are using:

gs://decompressautomated/dcm_account381803_click_*\.csv

Let us try:

gs://decompressautomated/dcm_account381803_click_*.csv

That page describes the use with the "gsutil" command, but it may be that the same applies to Data Fusion.

That does not work. Any chance we have a video call pls?