{% extends "simplebase.html" %} {% block content %}

Using kindle/audible as data sources

Audible & kindle are a great source for gathering audio & text data needed for voice cloning.
To build a data source, firstly purchase a matching audible & kindle book (typically referred to whispersync on amazon)

Audible to audio

To convert an audible audio book to an audio file (.mp3) I use AaxAudioConverter. Here are the steps to install and use:

  1. Download the Windows Audible app from the Microsoft store
  2. Open the audible app and download the audiobook you wish to convert
  3. Download AaxAudioConverter
  4. Run the application and click "Add" in the top left
  5. Select the audiobook you downloaded in the audible app
  6. Select "MP3" as the export format and "Single file per AAX file". Then click convert

Kindle to text

To convert a kindle book to text file (.txt) I developed an extension found in the dataset folder of this project. Here are the steps to install and use:

  1. Go to chrome://extensions/ in chrome
  2. Click on "Load unpacked" in the top left and select the "extension" folder in the dataset directory
  3. Open the first page of the book you want to scrape in the amazon cloud reader
  4. Open the extension & click "Scrape book". Wait for the reader to scan through all the pages
  5. When the scanner has reached the last page, press "Export book". This will download the text fill needed for step 1
{% endblock %}