Update upload_files_to_colab example to use genai module (#486)

Faakhir30 · Giom-V · web-flow · commit fc8517989e87 · 2025-03-04T17:52:02.000+01:00
Refers #446 * Updated `examples/Upload_files_to_Colab.ipynb` to use `google-genai` module instead of `generativeai`. * Removing link to the colab I'm removing the link to this colab in the README as I don't think it's worth having it there. While it's good to have this code example, it doesn't really bring any value as an example of the advanced capabilities of Gemini. --------- Signed-off-by: Faakhir30 <zahidfaakhir@gmail.com> Co-authored-by: Guillaume Vernade <Giom-V@users.noreply.github.com>
diff --git a/examples/README.md b/examples/README.md
@@ -40,7 +40,6 @@ Some old examples are still using the legacy SDK, they should still work and are
 * [Story writing with prompt chaining.ipynb](./Story_Writing_with_Prompt_Chaining.ipynb): Write a story using two powerful tools: prompt chaining and iterative generation.
 * [Tag and Caption images](./Tag_and_caption_images.ipynb): Use the Gemini model's vision capabilities and the embedding model to add tags and captions to images of pieces of clothing.
 * [Talk to documents](./Talk_to_documents_with_embeddings.ipynb): This is a basic intro to Retrieval Augmented Generation (RAG). Use embeddings to search through a custom database.
-* [Upload files to Colab](./Upload_files_to_Colab.ipynb): This is a helper notebook that shows how to upload files from your local computer to Colab. Note: to upload files to the Gemini API (text, code, images, audio, video), check out the [Files quickstart](../quickstarts/File_API.ipynb).
 * [Voice Memos](./Voice_memos.ipynb): You'll use the Gemini API to help you generate ideas for your next blog post, based on voice memos you recorded on your phone, and previous articles you've written.
 * [Translate a public domain](./Translate_a_Public_Domain_Book.ipynb): In this notebook, you will explore Gemini model as a translation tool, demonstrating how to prepare data, create effective prompts, and save results into a `.txt` file.
 * [Working with Charts, Graphs, and Slide Decks](./Working_with_Charts_Graphs_and_Slide_Decks.ipynb): Gemini models are powerful multimodal LLMs that can process both text and image inputs. This notebook shows how Gemini Flash model is capable of extracting data from various images.
diff --git a/examples/Upload_files_to_Colab.ipynb b/examples/Upload_files_to_Colab.ipynb
@@ -69,7 +69,7 @@
     },
     {
       "cell_type": "code",
-      "execution_count": 1,
+      "execution_count": 2,
       "metadata": {
         "id": "a90kMgiDvNlk"
       },
@@ -111,24 +111,22 @@
     },
     {
       "cell_type": "code",
-      "execution_count": null,
+      "execution_count": 3,
       "metadata": {
         "id": "_d_yY8XWGQ12"
       },
-      "outputs": [],
-      "source": [
-        "!pip install -U -q \"google-generativeai>=0.7.2\""
-      ]
-    },
-    {
-      "cell_type": "code",
-      "execution_count": null,
-      "metadata": {
-        "id": "TeVyF3GtGQ13"
-      },
-      "outputs": [],
+      "outputs": [
+        {
+          "name": "stdout",
+          "output_type": "stream",
+          "text": [
+            "\u001b[?25l   \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m0.0/137.7 kB\u001b[0m \u001b[31m?\u001b[0m eta \u001b[36m-:--:--\u001b[0m\r\u001b[2K   \u001b[91m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m\u001b[91m╸\u001b[0m\u001b[90m━\u001b[0m \u001b[32m133.1/137.7 kB\u001b[0m \u001b[31m3.9 MB/s\u001b[0m eta \u001b[36m0:00:01\u001b[0m\r\u001b[2K   \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m137.7/137.7 kB\u001b[0m \u001b[31m3.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
+            "\u001b[?25h"
+          ]
+        }
+      ],
       "source": [
-        "import google.generativeai as genai"
+        "%pip install -U -q \"google-genai>=1.0.0\""
       ]
     },
     {
@@ -142,21 +140,22 @@
     },
     {
       "cell_type": "code",
-      "execution_count": null,
+      "execution_count": 4,
       "metadata": {
         "id": "iWd---jVKV5M"
       },
       "outputs": [],
       "source": [
         "from google.colab import userdata\n",
+        "from google import genai\n",
         "\n",
         "GOOGLE_API_KEY=userdata.get('GOOGLE_API_KEY')\n",
-        "genai.configure(api_key=GOOGLE_API_KEY)"
+        "client = genai.Client(api_key=GOOGLE_API_KEY)"
       ]
     },
     {
       "cell_type": "code",
-      "execution_count": 3,
+      "execution_count": 5,
       "metadata": {
         "id": "11dbc57d09f2"
       },
@@ -165,17 +164,28 @@
           "name": "stdout",
           "output_type": "stream",
           "text": [
-            "This transcript is a record of the air-to-ground voice transmissions from the Apollo 11 mission. It covers the period from the launch of the Saturn V rocket up to the splashdown of the Apollo 11 spacecraft in the Pacific Ocean. The transcript is divided into 72 tapes. It includes communication between the astronauts (Commander Neil Armstrong, Command Module Pilot Michael Collins, and Lunar Module Pilot Edwin \"Buzz\" Aldrin) and ground control, including Mission Control Center (MCC) and other sites, and recordings from the astronauts' onboard systems.\n",
+            "Based on the provided text, this is a transcript of air-to-ground voice communications from the Apollo 11 mission. It includes:\n",
+            "\n",
+            "*   **Introduction:** Explains the document is a transcription of GOSS NET 1 (Technical Air-to-Ground Voice Transmission) for Apollo 11. It lists abbreviations used for different speakers (Commander, Command Module Pilot, Lunar Module Pilot, various ground control and recovery personnel).\n",
             "\n",
-            "The transcript is a valuable resource for understanding the events of the Apollo 11 mission, and it provides insights into the personalities and working methods of the astronauts and ground control personnel. The transcript also includes a great deal of technical information about the spacecraft and the mission, such as the detailed procedures that were followed, and the readings from the spacecraft's instruments.\n",
+            "*   **Abbreviations Key:** Provides a key to understand who the different communicators are (e.g., CDR = Commander, CMP = Command Module Pilot, CC = Capsule Communicator).\n",
+            "\n",
+            "*   **Air-to-Ground Voice Transcription:** The main body of the document, which is the transcribed dialogue between the Apollo 11 astronauts and mission control in Houston, as well as other ground stations. The text is segmented by timecode and location (e.g., MILA, GRAND BAHAMA ISLANDS, CANARY). It covers various aspects of the mission from launch to initial post-splashdown communications.\n",
+            "\n",
+            "In short, this is the official record of what was said between the Apollo 11 crew and ground control during the mission, providing a detailed account of procedures, observations, and conversations.\n",
             "\n"
           ]
         }
       ],
       "source": [
-        "model = genai.GenerativeModel('gemini-2.0-flash')\n",
-        "\n",
-        "response = model.generate_content(['What is this transcript?', text_data])\n",
+        "MODEL_ID = \"gemini-2.0-flash\" # @param [\"gemini-2.0-flash-lite\",\"gemini-2.0-flash\",\"gemini-2.0-pro-exp-02-05\"] {\"allow-input\":true, isTemplate: true}\n",
+        "response = client.models.generate_content(\n",
+        "    model=MODEL_ID,\n",
+        "    contents=[\n",
+        "        'What is this transcript?',\n",
+        "        text_data\n",
+        "    ]\n",
+        ")\n",
         "print(response.text)"
       ]
     }