Skip to content

Commit 37a5332

Browse files
committed
Restructure repository to prepare for dotnet samples
1 parent 4458fa3 commit 37a5332

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

41 files changed

+4824
-5156
lines changed

README.md

+25-10
Original file line numberDiff line numberDiff line change
@@ -28,6 +28,9 @@ The techniques demonstrated take advantage of various capabilities from each ser
2828
## Contents
2929

3030
- [Samples](#samples)
31+
- [Document Classification](#document-classification)
32+
- [Document Redaction](#document-redaction)
33+
- [Document Extraction](#document-extraction)
3134
- [Getting Started](#getting-started)
3235
- [Setup on GitHub Codespaces](#setup-on-github-codespaces)
3336
- [Setup on Local](#setup-on-local)
@@ -40,16 +43,28 @@ The techniques demonstrated take advantage of various capabilities from each ser
4043
> [!NOTE]
4144
> All data extraction samples provide both an accuracy and confidence score for the extracted data. The accuracy score is calculated based on the similarity between the extracted data and the ground truth data. The confidence score is calculated based on OCR analysis confidence and `logprobs` in Azure OpenAI requests.
4245
43-
| Sample | Description | Example Use Cases |
44-
| --------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
45-
| [Data Extraction - Azure AI Document Intelligence + Azure OpenAI GPT-4o](./samples/extraction/text-based/document-intelligence-openai.ipynb) | Demonstrates how to use Azure AI Document Intelligence pre-built layout and Azure OpenAI GPT models to extract structured data from documents. | Predominantly text-based documents such as invoices, receipts, and forms. |
46-
| [Data Extraction - Azure AI Document Intelligence + Phi](./samples/extraction/text-based/document-intelligence-phi.ipynb) | Demonstrates how to use Azure AI Document Intelligence pre-built layout and Microsoft's Phi small language models to extract structured data from documents. | Predominantly text-based documents such as invoices, receipts, and forms. |
47-
| [Data Extraction - Azure OpenAI GPT-4o with Vision](./samples/extraction/vision-based/openai.ipynb) | Demonstrates how to use Azure OpenAI GPT-4o and GPT-4o-mini models to extract structured data from documents using their built-in vision capabilities. | Complex documents with a mix of text and images, including diagrams, signatures, selection marks, etc. such as reports and contracts. |
48-
| [Data Extraction - Comprehensive Azure AI Document Intelligence + Azure OpenAI GPT-4o with Vision](./samples/extraction/vision-based/comprehensive.ipynb) | Demonstrates how to improve the accuracy and confidence in extracting structured data from documents by combining Azure AI Document Intelligence and Azure OpenAI GPT-4o models with vision capabilities. | Any structured or unstructured document type. |
49-
| [Classification - Azure OpenAI GPT-4o with Vision](./samples/classification/openai.ipynb) | Demonstrates how to use Azure OpenAI GPT-4o and GPT-4o-mini models to classify documents using their built-in vision capabilities. | Processing multiple documents types or documents with varying purposes, such as contracts, legal documents, and emails. |
50-
| [Classification - Azure AI Document Intelligence + Embeddings](./samples/classification/document-intelligence-embeddings.ipynb) | Demonstrates how to use Azure AI Document Intelligence pre-built layout and embeddings models to classify documents based on their content. | Processing multiple documents types or documents with varying purposes, such as contracts, legal documents, and emails. |
51-
| [Redaction - Azure AI Language PII Native Document Analysis](./samples/redaction/language-pii-redaction.ipynb) | Demonstrates how to use Azure AI Language Native Document Analysis features to redact personally identifiable information (PII) from documents. | Redacting sensitive information from documents, such as names, addresses, and phone numbers. |
52-
| [Redaction - Azure AI Document Intelligence + Azure OpenAI GPT-4o](./samples/redaction/document-intelligence-openai-redaction.ipynb) | Demonstrates how to use Azure AI Document Intelligence pre-built layout and Azure OpenAI GPT models to redact sensitive information from documents using natural language instruction to determine redaction areas. | Need for more complex redaction rules, such as redacting based on context or relationships. Also works well for redacting PII, including names, addresses, and phone numbers. |
46+
### Document Classification
47+
48+
| Sample | Description | Example Use Cases |
49+
| --------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------- |
50+
| [Classification - Azure OpenAI GPT-4o with Vision](./samples/python/classification/document-classification-gpt-vision.ipynb) | Demonstrates how to use Azure OpenAI GPT-4o and GPT-4o-mini models to classify documents using their built-in vision capabilities. | Processing multiple documents types or documents with varying purposes, such as contracts, legal documents, and emails. |
51+
| [Classification - Azure AI Document Intelligence + Embeddings](./samples/python/classification/document-classification-text-embeddings.ipynb) | Demonstrates how to use Azure AI Document Intelligence pre-built layout and embeddings models to classify documents based on their content. | Processing multiple documents types or documents with varying purposes, such as contracts, legal documents, and emails. |
52+
53+
### Document Redaction
54+
55+
| Sample | Description | Example Use Cases |
56+
| ------------------------------------------------------------------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
57+
| [Redaction - Azure AI Document Intelligence + Azure OpenAI GPT-4o](./samples/python/redaction/document-redaction-gpt.ipynb) | Demonstrates how to use Azure AI Document Intelligence pre-built layout and Azure OpenAI GPT models to redact sensitive information from documents using natural language instruction to determine redaction areas. | Need for more complex redaction rules, such as redacting based on context or relationships. Also works well for redacting PII, including names, addresses, and phone numbers. |
58+
| [Redaction - Azure AI Language PII Native Document Analysis](./samples/python/redaction/document-redaction-language-native-document.ipynb) | Demonstrates how to use Azure AI Language Native Document Analysis features to redact personally identifiable information (PII) from documents. | Redacting sensitive information from documents, such as names, addresses, and phone numbers. |
59+
60+
### Document Extraction
61+
62+
| Sample | Description | Example Use Cases |
63+
| ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | ------------------------------------------------------------------------------------------------------------------------------------- |
64+
| [Data Extraction - Azure AI Document Intelligence + Azure OpenAI GPT-4o](./samples/python/extraction/text/document-extraction-gpt.ipynb) | Demonstrates how to use Azure AI Document Intelligence pre-built layout and Azure OpenAI GPT models to extract structured data from documents. | Predominantly text-based documents such as invoices, receipts, and forms. |
65+
| [Data Extraction - Azure AI Document Intelligence + Phi](./samples/python/extraction/text/document-extraction-phi.ipynb) | Demonstrates how to use Azure AI Document Intelligence pre-built layout and Microsoft's Phi small language models to extract structured data from documents. | Predominantly text-based documents such as invoices, receipts, and forms. |
66+
| [Data Extraction - Azure OpenAI GPT-4o with Vision](./samples/python/extraction/vision/document-extraction-gpt-vision.ipynb) | Demonstrates how to use Azure OpenAI GPT-4o and GPT-4o-mini models to extract structured data from documents using their built-in vision capabilities. | Complex documents with a mix of text and images, including diagrams, signatures, selection marks, etc. such as reports and contracts. |
67+
| [Data Extraction - Comprehensive Azure AI Document Intelligence + Azure OpenAI GPT-4o with Vision](./samples/python/extraction/multimodal/document-extraction-gpt-text-and-vision.ipynb) | Demonstrates how to improve the accuracy and confidence in extracting structured data from documents by combining Azure AI Document Intelligence and Azure OpenAI GPT-4o models with vision capabilities. | Any structured or unstructured document type. |
5368

5469
## Getting Started
5570

samples/classification/README.md

-8
This file was deleted.

samples/extraction/README.md

-13
This file was deleted.

0 commit comments

Comments
 (0)