OpenAI Swift Realtime API with WebRTC

Overview

This Xcode project demos OpenAI's Realtime API with WebRTC (Advanced Voice Mode). It's an iOS application built with SwiftUI, AVFoundation, and the WebRTC package. It supports full AVM capabilities including interrupting the audio, sending text events manually, and controlling options such as the system message, realtime audio model, and voice.

ScreenRecording.mov

This video demos the iOS application running on MacOS

Requirements

iOS 16.0 or later
OpenAI API Key

Installation

Clone the the Repository:

git clone https://github.com/PallavAg/VoiceModeWebRTCSwift.git

Setup API Key:
- Replace the placeholder API_KEY in the code with your OpenAI API key:
```
let API_KEY = "your_openai_api_key"
```
- Alternatively, you can specify the OpenAI API Key in the app itself
Run the App:
- Go to the Signing & Capabilities section to first specify your account.
- Build and run the app on your iOS device, MacOS device, or simulator.

Usage

Start Connection:
- Launch the app and enter your API key in Settings if not specified already.
- Select your preferred AI model and voice, then press 'Start Connection' to begin the conversation.
Interact:
- Use the text input field or speak into the microphone to interact with the Realtime API.

Key Components

ContentView:
- The primary UI that orchestrates conversation, input, and connection controls.
WebRTCManager:
- Handles WebRTC connection setup, data channel communication, and audio processing.
OptionsView:
- Allows customization of API keys, models, and voice settings.

Troubleshooting

Microphone Permission:
- Ensure the app has microphone access in iOS settings.
Connection Issues:
- Check API key validity and server accessibility.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
VoiceModeWebRTCSwift.xcodeproj		VoiceModeWebRTCSwift.xcodeproj
VoiceModeWebRTCSwift		VoiceModeWebRTCSwift
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OpenAI Swift Realtime API with WebRTC

Overview

Requirements

Installation

Usage

Key Components

Troubleshooting

License

About

Languages

License

PallavAg/VoiceModeWebRTCSwift

Folders and files

Latest commit

History

Repository files navigation

OpenAI Swift Realtime API with WebRTC

Overview

Requirements

Installation

Usage

Key Components

Troubleshooting

License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages