Ask Me Anything: Simplify Day-2 Operations with the Google Cloud Architecture Framework

Published on ‎09-27-2022 07:55 AM by Community Manager | Updated on ‎09-27-2022 07:56 AM

Congratulations - you've deployed your app, set up your Kubernetes cluster, or launched a new service. But now come the challenges of day-2 operations, managing: configuration drift, service reliability, scaling up, and more.

To help teams simplify day-2 operations, we're hosting a live Ask Me Anything session on November 10th with Google Cloud operations experts, Rakesh Dhoopar, Omkar Suram, and others. We'll quickly recap the NEXT '22 session on "Reducing Day-2 Operational Complexity" and cover additional topics in more depth.

Join us to learn the important activities and tools needed to efficiently:

  • Operationalize your workload management
  • Optimize day-2 operations using Google managed services

As always, you'll have the opportunity to ask the experts questions and receive answers live.

Save your spot by selecting the "Yes" button on the right, and ask your questions in advance by posting a comment below. 

We hope to see you there!

Meeting information



Featured Guests


Event has ended
You can no longer attend this event.

Start:
Thu, Nov 10, 2022 10:00 AM PST
End:
Thu, Nov 10, 2022 11:00 AM PST
13 Comments
mrids
Bronze 1
Bronze 1

We have these challenges in Day-2:

  1. Tracing attack signatures on Cloud Armor and defining right set of rules hierarchy like blocking country list of banned countries. 
  2. When we migrate legacy applications, clients are also legacy and are not compliant to all OWASP rules. We need to permit certain runtime exceptions and block SQL injection kind of rules. These are not always known upfront and are detected Day 2 onwards. What are the ways to quickly mitigate them
  3. We use hybrid setup with VPN to our on-prem datacenter and firewalls in between. Detecting network issues and isolating them to VPN vs firewall become challenging
  4. Root Reconciler goes into loop. how does one configure right resources to avoid this. how does one estimate the same ahead of time
  5. Control Plane upgrades and auto-upgrade caused pod scaling issues. How does one prepare for that without provisioning extra resources. Nodes need to be flushed and drained to recover from this
  6. Pod's running out of space making it hard to determine what portion is being used by RAM and storage. Hard to figure out what portion is being used within ephemeral storage and causing memory flare ups
  7. How to detect any rogue IP has not intruded inside bypassing Cloud Armor rules. How to detect rogue elements are not connecting thru VPN or proxies.  We have examples where same source IP was allowed and also blocked. What detection /tracking/alerts do we need to build to identify those.
Lauren_vdv
Community Manager
Community Manager

Hey @mrids thanks for your questions! We did our best to address these questions during the live session. Please see the written responses in our event recap post here

Thank you! 

Thanks for the same as I did watch the recording later and saw my queries being addressed but some of them do need some detailed discussions for which I would reach out separately

UBITian
Bronze 5
Bronze 5

Very Logical Question.

SujanaRaja
Bronze 1
Bronze 1

Questions on Google Cloud Search that we are currently exploring for our Enterprise Search needs:

  1. Based on our understanding GCS doesn’t offer any configuration to adjust the importance of google sources like google drive etc, if this is true, is this likely to change in future.
  2. Can we override crowding configuration based on query context, for example if we believe the user is searching for a blog, and the blog source has crowding set to 4, can we override this during query time.
  3. Can people search be based on a different intranet application rather than google contacts? 
  4. Is there a feature to respond to queries with a contact as a result.  For ex: “Who” or “Whom” questions     a.Who is the CIO?     b.Who is an expert in AI/ML?     c.Whom to contact for payroll?
  5. Users can’t give explicit feedback for search results. Is this planned in the future?

Note: We are unable to make it to the session due to the timezone and will look forward to the recording. 

Rasvi
Bronze 3
Bronze 3

 Cloud Monitoring Chanel :  When will be able to connect internal webhook ( VPC has connection with on-premises via interconnect ) 

Lauren_vdv
Community Manager
Community Manager

@Rasvi For connecting to internal webhooks, you can route them through Pub/Sub. Uptime checks work against private endpoints as well. See Creating custom notifications with Cloud Monitoring and Cloud Run for more information.

Thanks!

I did the same instead of cloud run I have used cloud function , Thank you so much 

pfrankwicz
Bronze 4
Bronze 4

Hi from Maine; question from a Google Skill Boost associate cloud engineer student: is there documentation of yaml and runtime.go deployment files best known practices for successfully Day 2 operations [PaaS, Cloud Run, GKE, etc] Thanks, pf

Allieeee
Bronze 1
Bronze 1

Hun

Lauren_vdv
Community Manager
Community Manager

Nice Work @Lauren_vdv 

UBITian
Bronze 5
Bronze 5

@Lauren_vdv today Google's Trusted Partner Accredible award me something 🥳

View my Accomplishment over :-

https://www.linkedin.com/feed/update/activity:7005513805052932096