docs: Documentation for style transfer and bindings (#57)

NorbertKlockiewicz · chmjkb · Mateusz Kopciński · web-flow · commit aec95c6b1f22 · 2024-12-18T14:23:16.000+01:00
## Description
Style transfer and bindings docs pages

### Type of change
- [ ] Bug fix (non-breaking change which fixes an issue)
- [ ] New feature (non-breaking change which adds functionality)
- [ ] Breaking change (fix or feature that would cause existing
functionality to not work as expected)
- [x] Documentation update (improves or adds clarity to existing
documentation)

### Tested on
- [ ] iOS
- [ ] Android

### Testing instructions
&lt;!-- Provide step-by-step instructions on how to test your changes.
Include setup details if necessary. --&gt;

### Screenshots
&lt;!-- Add screenshots here, if applicable --&gt;

### Related issues
&lt;!-- Link related issues here using #issue-number --&gt;

### Checklist
- [ ] I have performed a self-review of my code
- [ ] I have commented my code, particularly in hard-to-understand areas
- [ ] I have updated the documentation accordingly
- [ ] My changes generate no new warnings

### Additional notes
&lt;!-- Include any additional information, assumptions, or context that
reviewers might need to understand this PR. --&gt;

---------

Co-authored-by: chmjkb &lt;jakubchmura1607@gmail.com&gt;
Co-authored-by: Mateusz Kopciński &lt;mateusz.kopcinski@swmansnion.com&gt;
diff --git a/docs/docs/computer-vision/_category_.json b/docs/docs/computer-vision/_category_.json
@@ -0,0 +1,7 @@
+{
+  "label": "Computer Vision",
+  "position": 3,
+  "link": {
+    "type": "generated-index"
+  }
+}
diff --git a/docs/docs/computer-vision/useObjectDetection.mdx b/docs/docs/computer-vision/useObjectDetection.mdx
@@ -0,0 +1,100 @@
+---
+title: useObjectDetection
+sidebar_position: 1
+---
+
+`useObjectDetection` is a hook that lets you seamlessly integrate object detection into your React Native application. Currently, the SSDLite320Large model with MobileNetv3 backbone is supported.
+
+## Reference
+```jsx
+import { useObjectDetection } from 'react-native-executorch';
+
+function App() {
+  const ssdlite = useObjectDetection({
+    modelSource: require("./assets/ssdlite320large_mobilenetv3.pte"),
+  });
+
+  ...
+  for (const detection of await ssdlite.forward("https://url-to-image.jpg")) {
+    console.log("Bounding box: ", detection.bbox);
+    console.log("Bounding label: ", detection.label);
+    console.log("Bounding score: ", detection.score);
+  }
+  ...
+}
+```
+
+<details>
+<summary>Type definitions</summary>
+
+```typescript
+```
+</details>
+
+### Arguments
+
+`modelSource`
+
+A String that specifies the path to the model file. You can download the model from our HuggingFace repository.
+For SSDLite, you can add it to your assets directory, and use `require()`. If you prefer to download the model
+the model in runtime instead of bundling it, you can use the constants that we ship with the library.
+
+### Returns
+
+The hook returns an object with the following properties:
+
+
+| Field               | Type                               | Description                                                                                                     |
+| ------------------- | ---------------------------------- | --------------------------------------------------------------------------------------------------------------- |
+| `forward`          | `(input: string) => Promise<Detection[]>` | Function that accepts an image and returns an array of Detection objects                                            |
+| `error`             | <code>string &#124; null</code>    | Contains the error message if the model failed to load or failed during generation process                                                          |
+| `isGenerating` | `boolean`                          | Indicates whether the model is processing the response                                                  |
+| `isReady`      | `boolean`                          | Indicates whether the model has properly loaded and is ready for inference                                                                            |
+
+### Detection object
+The detection object is specified as follows:
+```typescript
+interface Bbox {
+  x1: number;
+  y1: number;
+  x2: number;
+  y2: number;
+}
+
+interface Detection {
+  bbox: Bbox;
+  label: keyof typeof CocoLabels;
+  score: number;
+}
+```
+The `bbox` property contains information about the bounding box of detected objects. It is represented as two points, one on the left bottom part of the bounding box (x1, y1), the second one as the top right part (x2, y2).
+The label property contains the name of the detected object, which is one of `CocoLabels`. The `score` is a confidence score of the detected object.
+
+### Running the model
+
+To run the model, you can use the `forward` method. It accepts one argument, which is the image. It can be either a remote URL,
+a local file or base64 encoded image. The function returns an array of `Detection` objects. Each one contains coordinates
+of the bounding box, the label of the detected object and confidence score. For more information, please refer to the reference or example.
+
+### End to end example
+```tsx
+import { useObjectDetection, SSDLITE320LARGE_MOBILENETV3_WEIGHTS } from 'react-native-executorch';
+
+function App() {
+  const ssdlite = useObjectDetection({
+    modelSource: SSDLITE320LARGE_MOBILENETV3_WEIGHTS, // Can also use require('') as well
+  });
+
+  const runModel = async () => {
+    const detections = await ssdlite.forward("https://url-to-image.jpg");
+    for (const detection of detections) {
+      console.log("Bounding box: ", detection.bbox); // [x, y, width, height]
+      console.log("Bounding label: ", detection.label);
+      console.log("Bounding score: ", detection.score);
+    }
+  }
+}
+```
+
+### Benchmarks
+TODO
diff --git a/docs/docs/computer-vision/useStyleTransfer.md b/docs/docs/computer-vision/useStyleTransfer.md
@@ -0,0 +1,94 @@
+---
+title: useStyleTransfer
+sidebar_position: 1
+---
+
+Style transfer is a technique used in computer graphics and machine learning where the visual style of one image is applied to the content of another. This is achieved using algorithms that manipulate data from both images, typically with the aid of a neural network. The result is a new image that combines the artistic elements of one picture with the structural details of another, effectively merging art with traditional imagery. React Native ExecuTorch offers a dedicated hook `useStyleTransfer`, for this task. However before you start you'll need to obtain ExecuTorch-compatible model binary.
+
+:::caution
+It is recommended to use models provided by us which are available at our [HuggingFace repository](https://huggingface.co/software-mansion/react-native-executorch-style-transfer-candy), you can also use [constants](https://github.com/software-mansion/react-native-executorch/tree/main/src/constants/modelUrls.ts) shipped with our library
+:::
+
+## Reference
+
+```typescript
+import {
+  useStyleTransfer,
+  STYLE_TRANSFER_CANDY,
+} from 'react-native-executorch';
+
+const model = useStyleTransfer({
+  modelSource: STYLE_TRANSFER_CANDY,
+});
+
+const imageUri = 'file::///Users/.../cute_cat.png';
+
+try {
+  const generatedImageUrl = await model.forward(imageUri);
+} catch (error) {
+  console.error(error);
+}
+```
+
+<details>
+<summary>Type definitions</summary>
+
+```typescript
+interface StyleTransferModule {
+  error: string | null;
+  isReady: boolean;
+  isGenerating: boolean;
+  forward: (input: string) => Promise<string>;
+}
+```
+
+</details>
+
+### Arguments
+
+**`modelSource`**
+A string that specifies the location of the model binary. For more information, take a look at [loading models](../fundamentals/loading-models.md) page.
+
+### Returns
+
+| Field          | Type                                 | Description                                                                                              |
+| -------------- | ------------------------------------ | -------------------------------------------------------------------------------------------------------- |
+| `forward`      | `(input: string) => Promise<string>` | Executes the model's forward pass, where `input` can be a fetchable resource or a Base64-encoded string. |
+| `error`        | <code>string &#124; null</code>      | Contains the error message if the model failed to load.                                                  |
+| `isGenerating` | `boolean`                            | Indicates whether the model is currently processing an inference.                                        |
+| `isReady`      | `boolean`                            | Indicates whether the model has successfully loaded and is ready for inference.                          |
+
+## Running the model
+
+To run the moel, you can use `forward` method. It accepts one argument, which is the image. The image can be a remote URL, a local file URI, or a base64-encoded image. The function returns a promise which can resolve either to error or a URL to generated image.
+
+:::info
+Images from external sources and the generated image are stored in your application's temporary directory.
+:::
+
+## Example
+
+```typescript
+function App(){
+  const model = useStyleTransfer(
+      modelSource: STYLE_TRANSFER_CANDY,
+  );
+
+  ...
+  const imageUri = 'file::///Users/.../cute_cat.png';
+
+  try{
+      const generatedImageUrl = await model.forward(imageUri)
+  }catch(error){
+      console.error(error)
+  }
+  ...
+}
+```
+
+## Supported Models
+
+- [Candy](https://github.com/pytorch/examples/tree/main/fast_neural_style)
+- [Mosaic](https://github.com/pytorch/examples/tree/main/fast_neural_style)
+- [Udnie](https://github.com/pytorch/examples/tree/main/fast_neural_style)
+- [Rain princess](https://github.com/pytorch/examples/tree/main/fast_neural_style)
diff --git a/docs/docs/fundamentals/getting-started.mdx b/docs/docs/fundamentals/getting-started.mdx
@@ -45,11 +45,11 @@ If you plan on adding your models to the assets instead of fetching them from a
 
 This allows us to use binaries, such as exported models or tokenizers for LLMs.
 
-:::caution[Caution]
+:::caution
 When using Expo, please note that you need to use a custom development build of your app, not the standard Expo Go app. This is because we rely on native modules, which Expo Go doesn’t support.
 :::
 
-:::info[Info]
+:::info
 Because we are using ExecuTorch under the hood, you won't be able to build ios app for release with simulator selected as the target device. Make sure to test release builds on real devices.
 :::
 
diff --git a/docs/docs/fundamentals/loading-models.md b/docs/docs/fundamentals/loading-models.md
@@ -0,0 +1,45 @@
+---
+title: Loading models
+sidebar_position: 1
+---
+
+There are three different methods available for loading model files, depending on their size and location.
+
+**1. Load from React-Native assets folder (For Files < **512MB**)**
+
+```typescript
+modelSource: require('../assets/llama3_2.pte');
+```
+
+**2. Load from Remote URL:**
+
+For files larger than 512MB or when you want to keep size of the app smaller, you can load the model from a remote URL (e.g. HuggingFace).
+
+```typescript
+modelSource: 'https://.../llama3_2.pte';
+```
+
+**3. Load from local file system:**
+
+If you prefer to delegate the process of obtaining and loading model and tokenizer files to the user, you can use the following method:
+
+```typescript
+modelSource: 'file::///var/mobile/.../llama3_2.pte',
+```
+
+:::info
+The downloaded files are stored in documents directory of your application.
+:::
+
+## Example
+
+The following code snippet demonstrates how to load model and tokenizer files using `useLLM` hook:
+
+```typescript
+import { useLLM } from 'react-native-executorch';
+
+const llama = useLLM({
+  modelSource: 'https://.../llama3_2.pte',
+  tokenizer: require('../assets/tokenizer.bin'),
+});
+```
diff --git a/docs/docs/llms/_category_.json b/docs/docs/llms/_category_.json
@@ -1,5 +1,5 @@
 {
-  "label": "Guides",
+  "label": "LLMs",
   "position": 2,
   "link": {
     "type": "generated-index"
diff --git a/docs/docs/llms/exporting-llama.mdx b/docs/docs/llms/exporting-llama.mdx
diff --git a/docs/docs/llms/running-llms.md b/docs/docs/llms/running-llms.md
@@ -6,7 +6,7 @@ sidebar_position: 1
 React Native ExecuTorch supports Llama 3.2 models, including quantized versions. Before getting started, you’ll need to obtain the .pte binary—a serialized model—and the tokenizer. There are various ways to accomplish this:
 
 - For your convienience, it's best if you use models exported by us, you can get them from our hugging face repository. You can also use [constants](https://github.com/software-mansion/react-native-executorch/tree/main/src/constants/modelUrls.ts) shipped with our library.
-- If you want to export model by yourself,you can use a Docker image that we've prepared. To see how it works, check out [exporting Llama](./exporting-llama.mdx)
+- If you want to export model by yourself,you can use a Docker image that we've prepared. To see how it works, check out [exporting Llama](./exporting-llama)
 - Follow the official [tutorial](https://github.com/pytorch/executorch/blob/fe20be98c/examples/demo-apps/android/LlamaDemo/docs/delegates/xnnpack_README.md) made by ExecuTorch team to build the model and tokenizer yourself
 
 ## Initializing
@@ -25,17 +25,17 @@ const llama = useLLM({
 
 The code snippet above fetches the model from the specified URL, loads it into memory, and returns an object with various methods and properties for controlling the model. You can monitor the loading progress by checking the `llama.downloadProgress` and `llama.isReady` property, and if anything goes wrong, the `llama.error` property will contain the error message.
 
-:::danger[Danger]
+:::danger
 Lower-end devices might not be able to fit LLMs into memory. We recommend using quantized models to reduce the memory footprint.
 :::
 
-:::caution[Caution]
+:::caution
 Given computational constraints, our architecture is designed to support only one instance of the model runner at the time. Consequently, this means you can have only one active component leveraging `useLLM` concurrently.
 :::
 
 ### Arguments
 
-**`modelSource`** - A string that specifies the location of the model binary. For more information, take a look at [loading models](#loading-models) section.
+**`modelSource`** - A string that specifies the location of the model binary. For more information, take a look at [loading models](../fundamentals/loading-models.md) section.
 
 **`tokenizer`** - URL to the binary file which contains the tokenizer
 
@@ -55,36 +55,6 @@ Given computational constraints, our architecture is designed to support only on
 | `isReady`      | `boolean`                          | Indicates whether the model is ready                                                                            |
 | `downloadProgress`  | `number`                           | Represents the download progress as a value between 0 and 1, indicating the extent of the model file retrieval. |
 
-### Loading models
-
-There are three different methods available for loading the model and tokenizer files, depending on their size and location.
-
-**1. Load from React-Native assets folder (For Files < **512MB**)**
-
-```typescript
-modelSource: require('../assets/llama3_2.pte');
-```
-
-**2. Load from Remote URL:**
-
-For files larger than 512MB or when you want to keep size of the app smaller, you can load the model from a remote URL (e.g. HuggingFace).
-
-```typescript
-modelSource: 'https://.../llama3_2.pte';
-```
-
-**3. Load from local file system:**
-
-If you prefer to delegate the process of obtaining and loading model and tokenizer files to the user, you can use the following method:
-
-```typescript
-modelSource: 'file:://var/mobile/.../llama3_2.pte',
-```
-
-:::info[Info]
-The downloaded files are stored in documents directory of your application.
-:::
-
 ### Sending a message
 
 In order to send a message to the model, one can use the following code:
diff --git a/docs/docs/module-api/_category_.json b/docs/docs/module-api/_category_.json
@@ -0,0 +1,7 @@
+{
+  "label": "Module API",
+  "position": 4,
+  "link": {
+    "type": "generated-index"
+  }
+}
diff --git a/docs/docs/module-api/executorch-bindings.md b/docs/docs/module-api/executorch-bindings.md
diff --git a/ios/RnExecutorch/LLM.mm b/ios/RnExecutorch/LLM.mm

Original file line number	Diff line number	Diff line change
`@@ -1,5 +1,5 @@`
`1`	`1`	`{`
`2`		`- "label": "Guides",`
	`2`	`+ "label": "LLMs",`
`3`	`3`	`"position": 2,`
`4`	`4`	`"link": {`
`5`	`5`	`"type": "generated-index"`