Talking AI 🔊

Overview

Talking AI is a sophisticated Chrome Extension designed to bridge the gap between static web content and interactive learning. Built with React 19 and TypeScript, it leverages browser speech synthesis and LLM integration to allow users to listen to articles, explain complex jargon, and interact with text selections through a seamless, floating interface.

Description

Reading long-form content can be exhausting. Talking AI provides a modern solution by transforming any webpage into an accessible audio experience. Whether you need a quick summary of a technical paragraph or want the entire page read aloud while you multi-task, this extension provides a high-performance, low-latency toolkit powered by AI. 🚀

Features

🔊 Text-to-Speech (TTS): Instant natural voice synthesis for any selected text or full webpage.
🧠 AI-Powered Explanations: Integration with OpenRouter to simplify complex topics into digestible language.
🖱️ Contextual Floating UI: An intuitive action button that appears exactly where you select text.
⚙️ Deep Customization: Adjust playback speed, select different voice profiles, and define custom AI system prompts.
🎨 Modern Design: A clean, accessible interface built with Tailwind CSS 4 and Radix UI primitives.
📄 PDF Support: Works seamlessly on both standard web pages and local PDF files opened in the browser.

Technologies Used

Category	Technology
Frontend Framework	React 19
Language	TypeScript
Styling	Tailwind CSS 4
UI Components	Radix UI
Build Tool	Vite
Icons	Lucide React
State/Routing	React Router

Getting Started

Installation

Follow these steps to set up the project for development or to load it into your browser:

Clone the Repository

git clone https://github.com/Ddilibe/talking-ai.git
cd talking-ai

Install Dependencies
```
npm install
```
Build the Project Since this is a Chrome Extension, it must be compiled into a build directory.
```
npm run build
```
Load into Chrome
- Open Chrome and navigate to chrome://extensions/.
- Enable Developer mode (toggle in the top right).
- Click Load unpacked.
- Select the build folder generated in your project directory.

Usage

Using the Floating Control

Navigate to any website.
Highlight a sentence or paragraph.
A small floating speaker icon will appear near your cursor.
Click the icon to hear the text read aloud immediately.

Using the Popup Menu

Click the extension icon in your browser toolbar to open the main dashboard.
Read Page Aloud: Scans the document and begins sequential playback.
Explain Selection: Uses the AI model configured in your settings to describe the highlighted text.
Settings: Click the gear icon to enter your API key and customize the AI's "personality" or playback speed.

Configuration

To enable AI explanations, you must provide an API key:

Open the extension Settings.
Paste your Open Router API Key.
Select your preferred model (e.g., GPT-4o or GPT-3.5-Turbo).
Save settings. Your key is stored locally and never leaves your browser environment.

Extension Architecture

Content Scripts: Injected into pages to handle DOM selection events and speech synthesis.
Popup UI: A React application providing the primary interface.
Message Passing: Uses chrome.runtime and chrome.tabs APIs to communicate between the UI and the active web page.

Author

Dilibe - GitHub Profile
Let's connect on LinkedIn or Twitter.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
public		public
src		src
styles		styles
.gitignore		.gitignore
NOTES.md		NOTES.md
README.md		README.md
components.json		components.json
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
popup.html		popup.html
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Talking AI 🔊

Overview

Description

Features

Technologies Used