Skip to content

cygni/gcp-data-beans

Repository files navigation

Data-Beans-Demo

Welcome to Data Beans, a fiticous company that has a fleet of coffee trucks located in cities around the world. The source code provided here is design to high how you can use GenAI along with BigQuery on Google Cloud. The code is all working and was used to construct the demo. The demo consists of a web frontend along with the backend code contained in the project. To see the frontend, currently is it not deployed as part of the demo, please contact your account team or customer engienner.

This demo is an end to end working demo with all code provided so you can use based upon the contained license agreement.

Demo Cost

  • The demo costs ~ $1 a day to leave idle.
    • You can delete the Colab Runtime if you want to leave idle to mimimize costs. Just create the runtime when you run the demo.
    • You Colab Enterprise machine storage cost can grow. If your Colab costs become elevated, delete the Runtime (https://console.cloud.google.com/vertex-ai/colab/runtimes) and re-create it based upon the Runtime Template.
  • Running the notebooks that perform the GenAI API calls (or BigQuery) cost ~ $1 or so. It depends how much you run them. Running them for a few rows of data is mimimal cost; running for thousands will cost more.
  • Running the notebooks that perform the Machine Learning training (where you attach a GPU) cost the machine cost and time run.

Data Generation

  • All data in this demo is synthetically generated by LLMs.
  • All images in this demo are generated by GenAI.

How to deploy

The are two options to deploy the demo depending on your access privilages to your cloud organization

Require Permissions to Deploy (2 Options)

  1. Elevated Privileges - Org Level

    • The following IAM roles are required to deploy the solution
      • Prerequisite: Billing Account User (to create the project with billing)
    • To deploy the code you will:
      • Run source deploy.sh
  2. Owner Project Privileges - Typically Requires Assistance from IT

    • The following items are required to deploy the solution
      • Prerequisite: You will need a project created for you (IT can do this for you)
      • Prerequisite: You will need to be an Owner (IAM role) of the project to run the below script
    • To deploy the code you will
      • Update the hard coded values in deploy-use-existing-project-non-org-admin.sh
      • Run source deploy-use-existing-project-non-org-admin.sh

Using your Local machine (Assuming Linux based)

  1. Install Git (might already be installed)
  2. Install Curl (might already be installed)
  3. Install "jq" (might already be installed) - https://jqlang.github.io/jq/download/
  4. Install Google Cloud CLI (gcloud) - https://cloud.google.com/sdk/docs/install
  5. Install Terraform - https://developer.hashicorp.com/terraform/install
  6. Login:
    gcloud auth login
    gcloud auth application-default login
    
  7. Type: git clone https://github.com/GoogleCloudPlatform/data-beans
  8. Switch the prompt to the directory: cd data-beans
  9. Run the deployment script
    • If using Elevated Privileges
      • Run source deploy.sh
    • If using Owner Project Privileges
      • Update the hard coded values in deploy-use-existing-project-non-org-admin.sh
      • Run source deploy-use-existing-project-non-org-admin.sh
  10. Authorize the login (a popup will appear)
  11. Follow the prompts: Answer “Yes” for the Terraform approval.

To deploy through a Google Cloud Compute VM

  1. Create a new Compute VM with a Public IP address or Internet access on a Private IP
    • The default VM is fine (e.g.)
      • EC2 machine is fine for size
      • OS: Debian GNU/Linux 12 (bookworm)
  2. SSH into the machine. You might need to create a firewall rule (it will prompt you with the rule if it times out)
  3. Run these commands on the machine one by one:
    sudo apt update
    sudo apt upgrade -y
    sudo apt install git
    git config --global user.name "FirstName LastName"
    git config --global user.email "your@email-address.com"
    git clone https://github.com/GoogleCloudPlatform/data-beans
    cd data-beans/
    sudo apt-get install apt-transport-https ca-certificates gnupg curl
    sudo apt-get install jq
    gcloud auth login
    gcloud auth application-default login
    sudo apt-get update && sudo apt-get install -y gnupg software-properties-common
    wget -O- https://apt.releases.hashicorp.com/gpg | gpg --dearmor | sudo tee /usr/share/keyrings/hashicorp-archive-keyring.gpg > /dev/null
    gpg --no-default-keyring --keyring /usr/share/keyrings/hashicorp-archive-keyring.gpg --fingerprint
    echo "deb [signed-by=/usr/share/keyrings/hashicorp-archive-keyring.gpg] \
    https://apt.releases.hashicorp.com $(lsb_release -cs) main" | sudo tee /etc/apt/sources.list.d/hashicorp.list
    sudo apt update
    sudo apt-get install terraform
    
    source deploy.sh 
    # Or 
    # Update the hard coded values in deploy-use-existing-project-non-org-admin.sh
    # Run source deploy-use-existing-project-non-org-admin.sh
    

Cloud Shell (NOT WORKING)

  1. Open a Google Cloud Shell: http://shell.cloud.google.com/
  2. Type: git clone https://github.com/GoogleCloudPlatform/data-beans
  3. Switch the prompt to the directory: cd data-beans
  4. Run the deployment script
    • If using Elevated Privileges
      • Run source deploy.sh
    • If using Owner Project Privileges
      • Update the hard coded values in deploy-use-existing-project-non-org-admin.sh
      • Run source deploy-use-existing-project-non-org-admin.sh
  5. Authorize the login (a popup will appear)
  6. Follow the prompts: Answer “Yes” for the Terraform approval

About

A demo using Gen AI in GCP

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages