keras-team
diff --git a/‎guides/ipynb/keras_hub/hugging_face_keras_integration.ipynb
Lines changed: 68 additions & 40 deletions b/‎guides/ipynb/keras_hub/hugging_face_keras_integration.ipynb
Lines changed: 68 additions & 40 deletions
@@ -8,10 +8,10 @@
    "source": [
     "# Loading HuggingFace Transformers checkpoints into multi-backend KerasHub models\n",
     "\n",
-    "**Author:** [Laxma Reddy Patlolla](https://github.com/laxmareddyp), [Divyashree Sreepathihalli](https://github.com/divyashreepathihalli)<br><br>\n",
-    "**Date created:** 2025/06/17<br><br>\n",
-    "**Last modified:** 2025/06/17<br><br>\n",
-    "**Description:** How to load and run inference from KerasHub model checkpoints hosted on HuggingFace Hub."
+    "**Author:** [Laxma Reddy Patlolla](https://github.com/laxmareddyp), [Divyashree Sreepathihalli](https://github.com/divyashreepathihalli)<br>\n",
+    "**Date created:** 2025/06/17<br>\n",
+    "**Last modified:** 2025/06/23<br>\n",
+    "**Description:** How to load and run inference from KerasHub model checkpoints hosted on the HuggingFace Hub."
    ]
   },
   {
@@ -50,7 +50,10 @@
     "You'll primarily need `keras` and `keras_hub`.\n",
     "\n",
     "**Note:** Changing the backend after Keras has been imported might not work as expected.\n",
-    "Ensure `KERAS_BACKEND` is set at the beginning of your script."
+    "Ensure `KERAS_BACKEND` is set at the beginning of your script. Similarly, when working\n",
+    "outside of colab, you might use `os.environ[\"HF_TOKEN\"] = \"<YOUR_HF_TOKEN>\"` to authenticate\n",
+    "to HuggingFace. Set your `HF_TOKEN` as \"Colab secret\", when working with\n",
+    "Google Colab."
    ]
   },
   {
@@ -75,12 +78,37 @@
     "colab_type": "text"
    },
    "source": [
+    "### Changing precision\n",
+    "\n",
+    "To perform inference and training on affordable hardware, you can adjust your\n",
+    "model’s precision by configuring it through `keras.config` as follows"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 0,
+   "metadata": {
+    "colab_type": "code"
+   },
+   "outputs": [],
+   "source": [
+    "import keras\n",
+    "\n",
+    "keras.config.set_dtype_policy(\"bfloat16\")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {
+    "colab_type": "text"
+   },
+   "source": [
+    "## Loading a HuggingFace model\n",
+    "\n",
     "KerasHub allows you to easily load models from HuggingFace Transformers.\n",
     "Here's an example of how to load a Gemma causal language model.\n",
     "In this particular case, you will need to consent to Google's license on\n",
-    "HuggingFace for being able to download model weights, and provide your\n",
-    "`HF_TOKEN` as environment variable or as \"Colab secret\" when working with\n",
-    "Google Colab."
+    "HuggingFace for being able to download model weights."
    ]
   },
   {
@@ -162,8 +190,9 @@
    },
    "outputs": [],
    "source": [
+    "HF_USERNAME = \"<YOUR_HF_USERNAME>\"  # provide your hf username\n",
     "gemma_lm.save_to_preset(\"./gemma-2b-finetuned\")\n",
-    "keras_hub.upload_preset(\"hf://laxmareddyp/gemma-2b-finetune\", \"./gemma-2b-finetuned\")"
+    "keras_hub.upload_preset(f\"hf://{HF_USERNAME}/gemma-2b-finetune\", \"./gemma-2b-finetuned\")"
    ]
   },
   {
@@ -210,8 +239,8 @@
     "## Run transformer models in JAX backend and on TPUs\n",
     "\n",
     "To experiment with a model using JAX, you can utilize Keras by setting its backend to JAX.\n",
-    "By switching Keras\u2019s backend before model construction, and ensuring your environment is connected to a TPU runtime.\n",
-    "Keras will then automatically leverage JAX\u2019s TPU support,\n",
+    "By switching Keras’s backend before model construction, and ensuring your environment is connected to a TPU runtime.\n",
+    "Keras will then automatically leverage JAX’s TPU support,\n",
     "allowing your model to train efficiently on TPU hardware without further code changes."
    ]
   },
@@ -239,7 +268,7 @@
     "\n",
     "### Generation\n",
     "\n",
-    "Here\u2019s an example using Llama: Loading a PyTorch Hugging Face transformer checkpoint into KerasHub and running it on the JAX backend."
+    "Here’s an example using Llama: Loading a PyTorch Hugging Face transformer checkpoint into KerasHub and running it on the JAX backend."
    ]
   },
   {
@@ -277,43 +306,42 @@
     "colab_type": "text"
    },
    "source": [
-    "### Changing precision\n",
+    "## Comparing to Transformers\n",
+    "\n",
+    "In the following table, we have compiled a detailed comparison of HuggingFace's Transformers library with KerasHub:\n",
+    "\n",
+    "| Feature                    | HF Transformers                                                   | KerasHub                                                                                                                                                                                                                                                                              |\n",
+    "|----------------------------|-------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|\n",
+    "| Frameworks supported       | PyTorch                                                           | JAX, PyTorch, TensorFlow                                                                                                                                                                                                                                                         |\n",
+    "| Trainer                    | HF Trainer                                                        | Keras `model.fit(...)` — supports nearly all features such as distributed training, learning rate scheduling, optimizer selection, etc.                                                                                                                                             |\n",
+    "| Tokenizers                 | `AutoTokenizer`                                                   | [KerasHub Tokenizers](https://keras.io/keras_hub/api/tokenizers/)                                                                                                                                                                                                                     |\n",
+    "| Autoclass                  | `auto` keyword                                                    | KerasHub automatically [detects task-specific classes](https://x.com/fchollet/status/1922719664859381922)                                                                                                                                                                             |\n",
+    "| Model loading              | `AutoModel.from_pretrained()`                                     | `keras_hub.models.<Task>.from_preset()`<br><br>KerasHub uses task-specific classes (e.g., `CausalLM`, `Classifier`, `Backbone`) with a `from_preset()` method to load pretrained models, analogous to HuggingFace’s method.<br><br>Supports HF URLs, Kaggle URLs, and local directories |\n",
+    "| Model saving               | `model.save_pretrained()`<br>`tokenizer.save_pretrained()`        | `model.save_to_preset()` — saves the model (including tokenizer/preprocessor) into a local directory (preset). All components needed for reloading or uploading are saved.                                                                                                            |\n",
+    "| Model uploading            | Uploading weights to HF platform                                  | [KerasHub Upload Guide](https://keras.io/keras_hub/guides/upload/)<br>[Keras on Hugging Face](https://huggingface.co/keras)                                                                                                                                                           |\n",
+    "| Weights file sharding      | Weights file sharding                                             | Large model weights are sharded for efficient upload/download                                                                                                                                                                                                                         |\n",
+    "| PEFT                       | Uses [HuggingFace PEFT](https://github.com/huggingface/peft)      | Built-in LoRA support:<br>`backbone.enable_lora(rank=n)`<br>`backbone.save_lora_weights(filepath)`<br>`backbone.load_lora_weights(filepath)`                                                                                                                                          |\n",
+    "| Core model abstractions    | `PreTrainedModel`, `AutoModel`, task-specific models              | `Backbone`, `Preprocessor`, `Task`                                                                                                                                                                                                                                                    |\n",
+    "| Model configs              | `PretrainedConfig`: Base class for model configurations           | Configurations stored as multiple JSON files in preset directory: `config.json`, `preprocessor.json`, `task.json`, `tokenizer.json`, etc.                                                                                                                                             |\n",
+    "| Preprocessing              | Tokenizers/preprocessors often handled separately, then passed to the model | Built into task-specific models                                                                                                                                                                                                                                             |\n",
+    "| Mixed precision training   | Via training arguments                                            | Keras global policy setting                                                                                                                                                                                                                                                           |\n",
+    "| Compatibility with SafeTensors | Default weights format                                        | Of the 770k+ SafeTensors models on HF, those with a matching architecture in KerasHub can be loaded using `keras_hub.models.X.from_preset()`                                                                                                                                          |\n",
     "\n",
-    "You can adjust your model\u2019s precision by configuring it through `keras.config` as follows"
-   ]
-  },
-  {
-   "cell_type": "code",
-   "execution_count": 0,
-   "metadata": {
-    "colab_type": "code"
-   },
-   "outputs": [],
-   "source": [
-    "import keras\n",
-    "\n",
-    "keras.config.set_dtype_policy(\"bfloat16\")\n",
     "\n",
-    "from keras_hub.models import Llama3CausalLM\n",
-    "\n",
-    "causal_lm = Llama3CausalLM.from_preset(\"hf://NousResearch/Hermes-2-Pro-Llama-3-8B\")"
-   ]
-  },
-  {
-   "cell_type": "markdown",
-   "metadata": {
-    "colab_type": "text"
-   },
-   "source": [
     "Go try loading other model weights! You can find more options on HuggingFace\n",
     "and use them with `from_preset(\"hf://<namespace>/<model-name>\")`.\n",
     "\n",
     "Happy experimenting!"
    ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": []
   }
  ],
  "metadata": {
-  "accelerator": "None",
+  "accelerator": "GPU",
   "colab": {
    "collapsed_sections": [],
    "name": "hugging_face_keras_integration",
@@ -341,4 +369,4 @@
  },
  "nbformat": 4,
  "nbformat_minor": 0
-}
+}