Enterprise-grade information retrieval and search for AI applications
Azure AI Search is an information retrieval and search platform, designed to optimize retrieval-augmented generation (RAG) within Gen AI applications. Organizations can store, index and search their own data, delivering current information to AI models. Surface the most relevant information with cutting-edge technology including semantic ranking, vector and hybrid search.
Explore pricing options
Apply filters to customize pricing options to your needs.
Prices are estimates only and are not intended as actual price quotes. Actual pricing may vary depending on the type of agreement entered with Microsoft, date of purchase, and the currency exchange rate. Prices are calculated based on US dollars and converted using London closing spot rates that are captured in the two business days prior to the last business day of the previous month end. If the two business days prior to the end of the month fall on a bank holiday in major markets, the rate setting day is generally the day immediately preceding the two business days. This rate applies to all transactions during the upcoming month. Sign in to the Azure pricing calculator to see pricing based on your current program/offer with Microsoft. Contact an Azure sales specialist for more information on pricing or to request a price quote. See frequently asked questions about Azure pricing.
US government entities are eligible to purchase Azure Government services from a licensing solution provider with no upfront financial commitment, or directly through a pay-as-you-go online subscription.
Important—The price in R$ is merely a reference; this is an international transaction and the final price is subject to exchange rates and the inclusion of IOF taxes. An eNF will not be issued.
US government entities are eligible to purchase Azure Government services from a licensing solution provider with no upfront financial commitment, or directly through a pay-as-you-go online subscription.
Important—The price in R$ is merely a reference; this is an international transaction and the final price is subject to exchange rates and the inclusion of IOF taxes. An eNF will not be issued.
Service pricing
| Free | Basic | Standard S1 | Standard S2 | Standard S3 | Storage Optimized L1 | Storage Optimized L2 | |
|---|---|---|---|---|---|---|---|
| Storage1 | 50 MB | 15 GB (max 45 GB per service) | 160 GB (max 1.9 TB per service) | 512 GB (max 6 TB per service) | 1 TB (max 12 TB per service) | 2 TB (max 24 TB per service) | 4 TB (max 48 TB per service) | 
| Max indexes per service | 3 | 15 | 50 | 200 | 200 or 1,000/partition in high density mode | 10 | 10 | 
| Scale out limits | N/A | Up to 9 units per service (max 3 partition; max 3 replicas) | Up to 36 units per service (max 12 partition; max 12 replicas) | Up to 36 units per service (max 12 partition; max 12 replicas) | Up to 36 units per service (max 12 partition; max 12 replicas) up to 3 partitions in high density2 mode | Up to 36 units per service (max 12 partition; max 12 replicas) | Up to 36 units per service (max 12 partition; max 12 replicas) | 
| Price per SU (Search unit) | $- | $- | $- | $- | $- | $- | $- | 
1If you see lower storage in your service, you can upgrade.
2High density (HD) mode is an option available within the standard S3 service that allows a larger number of indexes to be created in a single service. For information on supported limits, refer here.
Agentic retrieval
Agentic retrieval incorporates conversation history and a bring-your-own-model (BYOM) integration into query planning, using an agentic framework to run and manage the RAG query pipeline.
| Product/Feature | Pricing (1M Tokens) | 
|---|---|
| Agentic retrieval | Agentic retrieval ranking input will be free to use starting May 19, 2025. Billing and associated charges will start in the summer of 2025. | 
Semantic ranker
Semantic ranker improves search relevance by finding content that is semantically similar to query terms. The service is only available for accounts on Basic, Standard tiers (S1, S2, and S3), and Storage-Optimized (L1 and L2) and has two pricing plans within those tiers. Use semantic ranker when you want to improve the quality of search results and optimize the user experience.
| Pricing | |
|---|---|
| Semantic ranker | First 1,000 requests per month free. $- per 1,000 additional requests. | 
Additional features
Custom Entity Lookup skill detects and labels documents containing user-defined words and phrases, and is available for all Basic, Standard, and Storage-Optimized tiers. It is ideal for defining and detecting specific entities in your data.
Document Cracking: Image Extraction retrieves content from a file during the built-in indexer document cracking phase and/or within the enrichment pipeline. Text extraction is free, while image extraction is billed during the initial document cracking step and/or when invoking the Document Extraction skill. Use this functionality when you have documents that contain images.
| Pricing Details | |
|---|---|
| Custom Entity Lookup skill | 0-1M text records $- per 1,000 text records 1M-3M text records $- per 1,000 text records 3M-10M text records $- per 1,000 text records 10M+ text records $- per 1,000 text records | 
| Document Cracking: Image Extraction | 0-1M images $- per 1,000 transactions 1M-5M images $- per 1,000 transactions 5M+ images $- per 1,000 transactions | 
Guidance for how to select the right SKU for your business
Every business has unique needs as it grows and evolves. Explore the guidance below to learn how to start with the right business model for your organization.
- 
            
            Prior to selecting a SKU, it's valuable to think through the requirements you have for your search service. The following are important questions to ask: - What are example queries your application users may write?
- What kind of information will they search for?
- Will users want to filter or sort through search results?
- How many indexes will you need?
- What is the size of your raw data files or blob storage?
- Will you need headroom for ingesting new documents/data in the future?
 
- 
            
            Azure AI Search is offered in combinable search "units" that vary by storage and throughput. Selecting the optimal pricing tier for your solution is based on the following factors: - Storage requirements: This refers to the amount of data you need to store in your search service. Each tier has a storage limit per partition, and you can add up to 12 partitions on a search service. It's important that your service has enough space for your data and has headroom to grow if you plan on ingesting more data.
 Pro Tip: The storage size of your search index will typically be smaller than the size of your raw data, particularly for files like PDFs and images. The best way to assess the storage size required for your search index is to index a representative sample of documents into your search index in the Azure portal with the Import Data Wizard. - Queries per Second (QPS): The throughput of your search service, often measured in QPS, will depend on your data and the types of queries users will issue. It's best to run performance tests to understand the throughput a particular tier can handle for your scenario but we have performance benchmarks that show the throughput different customer scenarios. Check out our performance benchmarking guidance to see sample customer scenarios.
 For additional guidance, please see our documentation: Choosing a Pricing Tier. 
- 
            
            Customers can further optimize their search experience with add-on services in the "Additional Features" table below. We also recommend customers read the following guidance on how to manage costs: 
- 
            
            For additional help, please see our options for one-on-one guidance. 
Azure pricing and purchasing options
 
                
            Connect with us directly
Get a walkthrough of Azure pricing. Understand pricing for your cloud solution, learn about cost optimization and request a custom proposal.
Talk to a sales specialistSee ways to purchase
Purchase Azure services through the Azure website, a Microsoft representative, or an Azure partner.
Explore your optionsAdditional resources
Azure AI Search
Learn more about Azure AI Search features and capabilities.
Pricing calculator
Estimate your expected monthly costs for using any combination of Azure products.
SLA
Review the Service Level Agreement for Azure AI Search.
Documentation
Review technical tutorials, videos, and more Azure AI Search resources.
Frequently asked questions
- 
            
            With Microsoft Azure AI Search, you are billed on a flat, predictable hourly rate based on the number of units that have been used during any given hour.
- 
            
            You are billed the last search unit count detected within an hour. If you start with two units, then scale to four units, and then scale back down to two units all within an hour, you will likely be charged for two units.
- 
            
            The stop button is meant to stop traffic to your service instance. As a result, your service is still running and will continue to be charged the hourly rate.
- 
            
            You are billed the flat rate for each hour the unit exists, regardless of usage or if the unit is active for less than an hour. For example, if you create a unit and delete it five minutes later, your bill will reflect a charge for one unit hour.
- 
            
            Azure AI Search units combine to provide additional throughput and storage. For example, to scale from 15 million documents to 30 million (additional partitions), a customer can purchase two units. To increase throughput (additional replicas), they can purchase two units. To increase both storage and throughput, a customer would need to purchase four units (2 replicas x 2 partitions = 4 search units).
- 
            
            Free is a free version of Azure AI Search designed to provide developers a sandbox to test features and implementations of Azure AI Search. It is not designed for production workloads. Basic, standard, and storage optimized are the go-to options for building applications that benefit from a self-managed search-as-a-service solution. Standard delivers storage and predictable throughput that scales with application needs. Storage Optimized editions offer significantly more storage at a reduced price per TB. For very high-demand applications, please contact azuresearch_contact@microsoft.com.
- 
            
            Azure AI Search is available in the new Microsoft Azure portal. First, you must sign up for an Azure subscription, then you can add an Azure AI Search account to your Azure subscription via the gallery in the preview portal. Get more information.
- 
            
            Agentic retrieval autonomously decides the best sources and methods for retrieving information, enhancing response quality through reasoning and planning. Semantic ranker improves search relevance by understanding and leveraging the semantics of user queries. Learn more here.
Talk to a sales specialist for a walk-through of Azure pricing. Understand pricing for your cloud solution.
Get free cloud services and a $200 credit to explore Azure for 30 days.
