fix minor issues

fedorov · fedorov · commit 6da1fced4791 · 2023-05-05T11:45:45.000-04:00
diff --git a/notebooks/getting_started/part2_searching_basics.ipynb b/notebooks/getting_started/part2_searching_basics.ipynb
@@ -65,7 +65,8 @@
       "cell_type": "code",
       "execution_count": null,
       "metadata": {
-        "id": "bDGChJBK9ooq"
+        "id": "bDGChJBK9ooq",
+        "cellView": "form"
       },
       "outputs": [],
       "source": [
@@ -139,7 +140,7 @@
         "  bigquery-public-data.idc_current.dicom_all\n",
         "```\n",
         "\n",
-        "To run this query interactively, copy the query above to the clipboard, paste it into the Editor tab in the [BigQuery SQL workspace](https://console.cloud.google.com/bigquery), and hit the \"Run\" button. Within few moments you should be able to see the list of collections in IDC in the \"Query results\" section of the interface.\n",
+        "To run this query interactively, copy the query above to the clipboard, paste it into the query tab in the [BigQuery SQL workspace](https://console.cloud.google.com/bigquery), and hit the \"Run\" button. Within few moments you should be able to see the list of collections in IDC in the \"Query results\" section of the interface.\n",
         "\n",
         "![bq_run](https://www.dropbox.com/s/6ah98n6e9ik18if/bq_run.png?raw=1)\n",
         "\n",
@@ -167,12 +168,12 @@
         "2. `idc_current`is a _dataset_ within the `bigquery-public-data` project. Think of BigQuery datasets as containers that are used to organize and control access to the tables within the project.\n",
         "3. `dicom_all` is one of the tables within the `idc_current` dataset. As you spend more time learning about IDC, you will hopefully leverage other tables available in that dataset.\n",
         "\n",
-        "If you now look back at the [BigQuery console](https://console.cloud.google.com/bigquery) and expand the list of datasets under the `bigquery-public-data` project, you will see that in addition to the `idc_current` dataset there are also datasets `idc_v12`, `idc_v11`, etc all the way to `idc_v1`. Those datasets correspond to the IDC data release versions, with `idc_current` being an alias for the latest (at the moment, v12) version of IDC data. \n",
+        "If you now look back at the [BigQuery console](https://console.cloud.google.com/bigquery) and expand the list of datasets under the `bigquery-public-data` project, you will see that in addition to the `idc_current` dataset there are also datasets `idc_v14`, `idc_v13`, etc all the way to `idc_v1`. Those datasets correspond to the IDC data release versions, with `idc_current` being an alias for the latest (at the moment of writing this, v14 is the latest release) version of IDC data. \n",
         "\n",
         "We will not spend time discussing how IDC versioning works, but it is important to know that \n",
         "\n",
         "1. IDC data is versioned;\n",
-        "2. queries against the `idc_current` dataset are equivalent to the queries against the latest version (currently, `idc_v12`) of IDC data;\n",
+        "2. queries against the `idc_current` dataset are equivalent to the queries against the latest version (currently, `idc_v14`) of IDC data;\n",
         "3. if you want the results of the queries to be persistent, write those against `idc_v*` datasets instead of `idc_current`."
       ]
     },
@@ -370,6 +371,8 @@
         "  # Use AND operator to combine the filter values for the\n",
         "  # Modality and tcia_tumorLocation to select collections that\n",
         "  # include MR images for Lung cancer locations\n",
+        "  # Note that SQL uses single = for comparison, and strings should\n",
+        "  # be enclosed in \"\"\n",
         "\"\"\"\n",
         "\n",
         "selection_result = bq_client.query(selection_query)\n",
@@ -415,7 +418,6 @@
         "# we specified in the beginning of the notebook!\n",
         "bq_client = bigquery.Client(my_ProjectID)\n",
         "\n",
-        "# Execution of this cell will fail unless you wrote the query below!\n",
         "selection_query = \"\"\"\n",
         "SELECT \n",
         "  COUNT(DISTINCT(PatientID)) as patient_cnt\n",