Get started with Azure OpenAI service

Azure OpenAI Service brings the generative AI models developed by OpenAI to the Azure platform, enabling you to develop powerful AI solutions that benefit from the security, scalability, and integration of services provided by the Azure cloud platform. In this exercise, you’ll learn how to get started with Azure OpenAI by provisioning the service as an Azure resource and using Azure AI Foundry to deploy and explore generative AI models.

In the scenario for this exercise, you will perform the role of a software developer who has been tasked to implement an AI agent that can use generative AI to help a marketing organization improve its effectiveness at reaching customers and advertising new products. The techniques used in the exercise can be applied to any scenario where an organization wants to use generative AI models to help employees be more effective and productive.

This exercise takes approximately 30 minutes.

Provision an Azure OpenAI resource

If you don’t already have one, provision an Azure OpenAI resource in your Azure subscription.

Sign into the Azure portal at https://portal.azure.com.
Create an Azure OpenAI resource with the following settings:
- Subscription: Select an Azure subscription that has been approved for access to the Azure OpenAI service
- Resource group: Choose or create a resource group
- Region: Make a random choice from any of the following regions*
  - East US
  - East US 2
  - North Central US
  - South Central US
  - Sweden Central
  - West US
  - West US 3
- Name: A unique name of your choice
- Pricing tier: Standard S0
* Azure OpenAI resources are constrained by regional quotas. The listed regions include default quota for the model type(s) used in this exercise. Randomly choosing a region reduces the risk of a single region reaching its quota limit in scenarios where you are sharing a subscription with other users. In the event of a quota limit being reached later in the exercise, there’s a possibility you may need to create another resource in a different region.
Wait for deployment to complete. Then go to the deployed Azure OpenAI resource in the Azure portal.

Deploy a model

Azure provides a web-based portal named Azure AI Foundry portal, that you can use to deploy, manage, and explore models. You’ll start your exploration of Azure OpenAI by using Azure AI Foundry portal to deploy a model.

Note: As you use Azure AI Foundry portal, message boxes suggesting tasks for you to perform may be displayed. You can close these and follow the steps in this exercise.

In the Azure portal, on the Overview page for your Azure OpenAI resource, scroll down to the Get Started section and select the button to go to AI Foundry portal (previously AI Studio).
In Azure AI Foundry portal, in the pane on the left, select the Deployments page and view your existing model deployments. If you don’t already have one, create a new deployment of the gpt-4o model with the following settings:
- Deployment name: A unique name of your choice
- Model: gpt-4o
- Model version: Use default version
- Deployment type: Standard
- Tokens per minute rate limit: 5K*
- Content filter: Default
- Enable dynamic quota: Disabled
* A rate limit of 5,000 tokens per minute is more than adequate to complete this exercise while leaving capacity for other people using the same subscription.

Use the Chat playground

Now that you’ve deployed a model, you can use it to generate responses based on natural language prompts. The Chat playground in Azure AI Foundry portal provides a chatbot interface for GPT 4 and higher models.

Note: The Chat playground uses the ChatCompletions API rather than the older Completions API that is used by the Completions playground. The Completions playground is provided for compatibility with older models.

In the Playground section, select the Chat page. The Chat playground page consists of a row of buttons and two main panels (which may be arranged right-to-left horizontally, or top-to-bottom vertically depending on your screen resolution):
- Configuration - used to select your deployment, define system message, and set parameters for interacting with your deployment.
- Chat session - used to submit chat messages and view responses.
Under Deployments, ensure that your gpt-4o model deployment is selected.
Review the default System message, which should be You are an AI assistant that helps people find information. The system message is included in prompts submitted to the model, and provides context for the model’s responses; setting expectations about how an AI agent based on the model should interact with the user.
In the Chat session panel, enter the user query How can I use generative AI to help me market a new product?

Note: You may receive a response that the API deployment is not yet ready. If so, wait for a few minutes and try again.
Review the response, noting that the model has generated a cohesive natural language answer that is relevant to the query with which it was prompted.
Enter the user query What skills do I need if I want to develop a solution to accomplish this?.
Review the response, noting that the chat session has retained the conversational context (so “this” is interpreted as a generative AI solution for marketing). This contextualization is achieved by including the recent conversation history in each successive prompt submission, so the prompt sent to the model for the second query included the original query and response as well as the new user input.
In the Chat session panel toolbar, select Clear chat and confirm that you want to restart the chat session.
Enter the query Can you help me find resources to learn those skills? and review the response, which should be a valid natural language answer, but since the previous chat history has been lost, the answer is likely to be about finding generic skilling resources rather than being related to the specific skills needed to build a generative AI marketing solution.

Experiment with system messages, prompts, and few-shot examples

So far, you’ve engaged in a chat conversation with your model based on the default system message. You can customize the system setup to have more control over the kinds of responses generated by your model.

In the main toolbar, select the Prompt samples, and use the Marketing Writing Assistant prompt template.
Review the new system message, which describes how an AI agent should use the model to respond.
In the Chat session panel, enter the user query Create an advertisement for a new scrubbing brush.
Review the response, which should include advertising copy for a scrubbing brush. The copy may be quite extensive and creative.

In a real scenario, a marketing professional would likely already know the name of the scrubbing brush product as well as have some ideas about key features that should be highlighted in an advert. To get the most useful results from a generative AI model, users need to design their prompts to include as much pertinent information as possible.
Enter the prompt Revise the advertisement for a scrubbing brush named "Scrubadub 2000", which is made of carbon fiber and reduces cleaning times by half compared to ordinary scrubbing brushes.
Review the response, which should take into account the additional information you provided about the scrubbing brush product.

The response should now be more useful, but to have even more control over the output from the model, you can provide one or more few-shot examples on which responses should be based.

Under the System message text box, expand the dropdown for Add section and select Examples. Then type the following message and response in the designated boxes:

User:

 Write an advertisement for the lightweight "Ultramop" mop, which uses patented absorbent materials to clean floors.

Assistant:

 Welcome to the future of cleaning!
    
 The Ultramop makes light work of even the dirtiest of floors. Thanks to its patented absorbent materials, it ensures a brilliant shine. Just look at these features:
 - Lightweight construction, making it easy to use.
 - High absorbency, enabling you to apply lots of clean soapy water to the floor.
 - Great low price.
    
 Check out this and other products on our website at www.contoso.com.

Use the Apply changes button to save the examples and start a new session.
In the Chat session section, enter the user query Create an advertisement for the Scrubadub 2000 - a new scrubbing brush made of carbon fiber that reduces cleaning time by half.
Review the response, which should be a new advert for the “Scrubadub 2000” that is modeled on the “Ultramop” example provided in the system setup.

Experiment with parameters

You’ve explored how the system message, examples, and prompts can help refine the responses returned by the model. You can also use parameters to control model behavior.

In the Configuration panel, select the Parameters tab and set the following parameter values:
- Max response: 1000
- Temperature: 1
In the Chat session section, use the Clear chat button to reset the chat session. Then enter the user query Create an advertisement for a cleaning sponge and review the response. The resulting advertisement copy should include a maximum of 1000 text tokens, and include some creative elements - for example, the model may have invented a product name for the sponge and made some claims about its features.
Use the Clear chat button to reset the chat session again, and then re-enter the same query as before (Create an advertisement for a cleaning sponge) and review the response. The response may be different from the previous response.
In the Configuration panel, on the Parameters tab, change the Temperature parameter value to 0.
In the Chat session section, use the Clear chat button to reset the chat session again, and then re-enter the same query as before (Create an advertisement for a cleaning sponge) and review the response. This time, the response may not be quite so creative.
Use the Clear chat button to reset the chat session one more time, and then re-enter the same query as before (Create an advertisement for a cleaning sponge) and review the response; which should be very similar (if not identical) to the previous response.

The Temperature parameter controls the degree to which the model can be creative in its generation of a response. A low value results in a consistent response with little random variation, while a high value encourages the model to add creative elements its output; which may affect the accuracy and realism of the response.

Deploy your model to a web app

Now that you’ve explored some of the capabilities of a generative AI model in the Azure AI Foundry playground, you can deploy an Azure web app to provide a basic AI agent interface through which users can chat with the model.

Note: For some users, deploying to the web app cannot be deployed due to a bug in the template in the studio. If that’s the case, skip this section.

At the top right of the Chat playground page, in the Deploy to menu, select A new web app.
In the Deploy to a web app dialog box, create a new web app with the following settings:
- Name: A unique name
- Subscription: Your Azure subscription
- Resource group: The resource group in which you provisioned your Azure OpenAI resource
- Locations: The region where you provisioned your Azure OpenAI resource
- Pricing plan: Free (F1) - If this is not available, select Basic (B1)
- Enable chat history in the web app: Unselected
- I acknowledge that web apps will incur usage to my account: Selected
Deploy the new web app and wait for deployment to complete (which may take 10 minutes or so)
After your web app has deployed successfully, use the button at the top right of the Chat playground page to launch the web app. The app may take a few minutes to launch. If prompted, accept the permissions request.

In the web app, enter the following chat message:

 Write an advertisement for the new "WonderWipe" cloth that attracts dust particulates and can be used to clean any household surface.

Review the response.

Note: You deployed the model to a web app, but this deployment doesn’t include the system settings and parameters you set in the playground; so the response may not reflect the examples you specified in the playground. In a real scenario, you would add logic to your application to modify the prompt so that it includes the appropriate contextual data for the kinds of response you want to generate. This kind of customization is beyond the scope of this introductory-level exercise, but you can learn about prompt engineering techniques and Azure OpenAI APIs in other exercises and product documentation.
When you have finished experimenting with your model in the web app, close the web app tab in your browser to return to Azure AI Foundry portal.

Clean up

When you’re done with your Azure OpenAI resource, remember to delete the deployment or the entire resource in the Azure portal at https://portal.azure.com.