Learn OpenAI Whisper: Transform your understanding of GenAI through robust and accurate speech processing solutions 282229

Код товару: 282229Паперова книга
  • ISBN
    978-1835085929
  • Бренд
  • Автор
  • Рік
    2024
  • Мова
    Англійська
  • Ілюстрації
    Чорно-білі
Master automatic speech recognition (ASR) with groundbreaking generative AI for unrivaled accuracy and versatility in audio processing

Key Features
  • Uncover the intricate architecture and mechanics behind Whisper's robust speech recognition
  • Apply Whisper's technology in innovative projects, from audio transcription to voice synthesis
  • Navigate the practical use of Whisper in real-world scenarios for achieving dynamic tech solutions
Book Description
As the field of generative AI evolves, so does the demand for intelligent systems that can understand human speech. Navigating the complexities of automatic speech recognition (ASR) technology is a significant challenge for many professionals. This book offers a comprehensive solution that guides you through OpenAI's advanced ASR system.
You'll begin your journey with Whisper's foundational concepts, gradually progressing to its sophisticated functionalities. Next, you'll explore the transformer model, understand its multilingual capabilities, and grasp training techniques using weak supervision. The book helps you customize Whisper for different contexts and optimize its performance for specific needs. You'll also focus on the vast potential of Whisper in real-world scenarios, including its transcription services, voice-based search, and the ability to enhance customer engagement. Advanced chapters delve into voice synthesis and diarization while addressing ethical considerations.
By the end of this book, you'll have an understanding of ASR technology and have the skills to implement Whisper. Moreover, Python coding examples will equip you to apply ASR technologies in your projects as well as prepare you to tackle challenges and seize opportunities in the rapidly evolving world of voice recognition and processing.

What you will learn
  • Integrate Whisper into voice assistants and chatbots
  • Use Whisper for efficient, accurate transcription services
  • Understand Whisper's transformer model structure and nuances
  • Fine-tune Whisper for specific language requirements globally
  • Implement Whisper in real-time translation scenarios
  • Explore voice synthesis capabilities using Whisper's robust tech
  • Execute voice diarization with Whisper and NVIDIA's NeMo
  • Navigate ethical considerations in advanced voice technology
Who this book is for
Learn OpenAI Whisper is designed for a diverse audience, including AI engineers, tech professionals, and students. It's ideal for those with a basic understanding of machine learning and Python programming, and an interest in voice technology, from developers integrating ASR in applications to researchers exploring the cutting-edge possibilities in artificial intelligence.

About the Author
Josue Batista, a Digital Strategist and Solution Architect, has drawn on his rich blend of academic prowess and industry experience while writing this book. With an MBA and a Master's in Information Systems Management, he has excelled in various roles, including as a technical programmatic leader at PPG Industries, Meta's Reality Research Labs, and Harvard Business School, where he focused on generative AI and large language models. His book draws on his extensive experience in new technology introduction, offering readers a comprehensive guide to mastering OpenAI's Whisper for practical applications. Originally from Ciudad Bolivar, Venezuela, Josue resides in Pittsburgh, PA, with his wife and feline companions.
1'700 ₴
Купити
Monobank
до 10 платежей
от 191 ₴ / міс.
  • Нова Пошта
    Безкоштовно від 3'000,00 ₴
  • Укрпошта
    Безкоштовно від 1'000,00 ₴
  • Meest Пошта
    Безкоштовно від 3'000,00 ₴
Learn OpenAI Whisper: Transform your understanding of GenAI through robust and accurate speech processing solutions - фото 1
Інші книги Packt Publishing
ChatGPT for Cybersecurity Cookbook: Learn practical generative AI recipes to supercharge your cybersecurity skills
282453
Clint Bodungen
1'700 ₴
The Software Developer's Guide to Linux: A practical, no-nonsense guide to using the Linux command line and utilities as a software developer
289709
David CohenChristian Sturm
1'900 ₴
Ethical Hacking Workshop: Explore a practical approach to learning and applying ethical hacking techniques for effective cybersecurity
263204
Rishalin PillayMohammed Abutheraa
1'500 ₴
Exploring Deepfakes: Deploy powerful AI techniques for face replacement and more with this comprehensive guide
264115
Bryan LyonMatt Tora
1'900 ₴
Web Development Career Master Plan: Learn what it means to be a web developer and launch your journey toward a career in the industry
282242
Frank W. Zammetti
1'800 ₴
C++ Programming for Linux Systems: Create robust enterprise software for Linux and Unix-based operating systems 1st Edition
308351
Desislav AndreevStanimir Lukanov
2'100 ₴
Learning Spring Boot 3.0: Simplify the development of production-grade applications using Java and Spring, 3rd Editio
255176
Greg L. Turnquist
840 ₴
Python Data Cleaning Cookbook: Prepare your data for analysis with pandas, NumPy, Matplotlib, scikit-learn, and OpenAI 2nd ed. Edition
286419
Michael Walker
1'900 ₴
Go Programming - From Beginner to Professional - Second Edition: Learn everything you need to build modern software using Go 2nd ed. Edition
281117
Samantha Coyle
1'900 ₴

Характеристики

  • Бренд
  • Автор
  • Категорія
    Програмування
  • Рік
    2024
  • Сторінок
    372
  • Формат
    185х235 мм
  • Обкладинка
    М'яка
  • Тип паперу
    Офсетний
  • Мова
    Англійська
  • Ілюстрації
    Чорно-білі

Від видавця

Master automatic speech recognition (ASR) with groundbreaking generative AI for unrivaled accuracy and versatility in audio processing

Key Features
  • Uncover the intricate architecture and mechanics behind Whisper's robust speech recognition
  • Apply Whisper's technology in innovative projects, from audio transcription to voice synthesis
  • Navigate the practical use of Whisper in real-world scenarios for achieving dynamic tech solutions
Book Description
As the field of generative AI evolves, so does the demand for intelligent systems that can understand human speech. Navigating the complexities of automatic speech recognition (ASR) technology is a significant challenge for many professionals. This book offers a comprehensive solution that guides you through OpenAI's advanced ASR system.
You'll begin your journey with Whisper's foundational concepts, gradually progressing to its sophisticated functionalities. Next, you'll explore the transformer model, understand its multilingual capabilities, and grasp training techniques using weak supervision. The book helps you customize Whisper for different contexts and optimize its performance for specific needs. You'll also focus on the vast potential of Whisper in real-world scenarios, including its transcription services, voice-based search, and the ability to enhance customer engagement. Advanced chapters delve into voice synthesis and diarization while addressing ethical considerations.
By the end of this book, you'll have an understanding of ASR technology and have the skills to implement Whisper. Moreover, Python coding examples will equip you to apply ASR technologies in your projects as well as prepare you to tackle challenges and seize opportunities in the rapidly evolving world of voice recognition and processing.

What you will learn
  • Integrate Whisper into voice assistants and chatbots
  • Use Whisper for efficient, accurate transcription services
  • Understand Whisper's transformer model structure and nuances
  • Fine-tune Whisper for specific language requirements globally
  • Implement Whisper in real-time translation scenarios
  • Explore voice synthesis capabilities using Whisper's robust tech
  • Execute voice diarization with Whisper and NVIDIA's NeMo
  • Navigate ethical considerations in advanced voice technology
Who this book is for
Learn OpenAI Whisper is designed for a diverse audience, including AI engineers, tech professionals, and students. It's ideal for those with a basic understanding of machine learning and Python programming, and an interest in voice technology, from developers integrating ASR in applications to researchers exploring the cutting-edge possibilities in artificial intelligence.

About the Author
Josue Batista, a Digital Strategist and Solution Architect, has drawn on his rich blend of academic prowess and industry experience while writing this book. With an MBA and a Master's in Information Systems Management, he has excelled in various roles, including as a technical programmatic leader at PPG Industries, Meta's Reality Research Labs, and Harvard Business School, where he focused on generative AI and large language models. His book draws on his extensive experience in new technology introduction, offering readers a comprehensive guide to mastering OpenAI's Whisper for practical applications. Originally from Ciudad Bolivar, Venezuela, Josue resides in Pittsburgh, PA, with his wife and feline companions.

Відгуки про Learn OpenAI Whisper: Transform your understanding of GenAI through robust and accurate speech processing solutions

Learn OpenAI Whisper: Transform your understanding of GenAI through robust and accurate speech processing solutions
Learn OpenAI Whisper: Transform your understanding of GenAI through robust and accurate speech processing solutions
1'700 ₴
Купити
Персонально для вас
Essential GraphRAG: Knowledge Graph-Enhanced RAG
310262
Tomaz BratanicOskar Hane
1'600 ₴
Designing Deep Learning Systems: A software engineer's guide
246935
Chi WangDonald Szeto
1'650 ₴
Practical AI for Healthcare Professionals. Machine Learning with Numpy, Scikit-learn, and TensorFlow. 1st Ed.
244715
Abhinav Suri
1'700 ₴
Architecting Data and Machine Learning Platforms: Enable Analytics and AI-Driven Innovation in the Cloud 1st Edition
274432
Marco TranquillinFirat TekinerValliappa Lakshmanan
1'700 ₴
Beyond the Algorithm: AI, Security, Privacy, and Ethics 1st Edition
277686
Omar SantosPetar Radanliev
1'700 ₴
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning 1st Edition
280743
Margaux Masson-Forsythe
1'700 ₴
MLOps with Ray: Best Practices and Strategies for Adopting Machine Learning Operations First Edition
281515
Hien LuuZhe ZhangMax Pumperla
1'700 ₴
ChatGPT for Cybersecurity Cookbook: Learn practical generative AI recipes to supercharge your cybersecurity skills
282453
Clint Bodungen
1'700 ₴
Generative AI Foundations in Python: Discover key techniques and navigate modern challenges in LLMs
295065
Carlos RodriguezSamira Shaikh
1'700 ₴
Python Machine Learning By Example: Unlock machine learning best practices with real-world use cases 4th Edition
299766
Yuxi (Hayden) Liu
1'700 ₴
How Large Language Models Work
308167
Edward RaffDrew FarrisStella Biderman
1'700 ₴
Analytical Skills for AI and Data Science: Building Skills for an AI-Driven Enterprise
153335
Daniel Vaughan
1'800 ₴
Artificial Neural Networks with Java. 2nd Ed.
244661
Igor Livshin
1'800 ₴
Hands-on Machine Learning with Python. Implement Neural Network Solutions with Scikit-learn and PyTorch. 1st Ed.
244683
Ashwin Pajankar, Aditya Joshi
1'800 ₴
Fundamentals of Analytics Engineering: An introduction to building end-to-end analytics solutions
278869
Dumky de WildeFanny KassapianJovan Gligorevic
1'600 ₴
WebAssembly: The Definitive Guide: Safe, Fast, and Portable Code. 1st Ed.
244800
Brian Sletten
2'000 ₴
MLOps with Ray: Best Practices and Strategies for Adopting Machine Learning Operations First Edition
281515
Hien LuuZhe ZhangMax Pumperla
1'700 ₴
Becoming SRE: First Steps Toward Reliability for You and Your Organization 1st Edition
275779
David N. Blank-Edelman
1'700 ₴
Effective Conversational AI: Chatbots that work
305302
Andrew FreedEniko RozsaCari Jacobs
2'400 ₴
Beyond the Algorithm: AI, Security, Privacy, and Ethics 1st Edition
277686
Omar SantosPetar Radanliev
1'700 ₴
AI for Everyday IT: Accelerate workplace productivity
308363
Chrissy LeMaireBrandon Abshire
2'300 ₴
Alex Katz Catalogue Raisonn. Prints 1947-2022
296517
Klaus Albrecht SchroderMarietta Mautner MarkhofGunhild Bauer
9'390 ₴