Using AWS Polly And IBM Watson Text-To-Speech And Tone Analyser Artificial Intelligence Services To Read and Analyse Clinical Chat Data (Part 2)

Note: Part one to this series can be found HERE

In my last blog post I outlined the concept of creating a simple Python GUI application which utilised Amazon Polly Text-To-Speech cloud API. The premise was quite simple – retrieve chat data stored in SQL Server database and pass it to Polly API to convert it into audible stream using a choice of different male and female voices.

Whilst this functionality provided a good ‘playground’ to showcase one of the multitude of cloud-enabled machine learning applications, I felt that augmenting text-to-voice feature with some visual clues as a representation of the chat content would provide additional value. This is where I thought pairing text-to-speech with linguistic analysis can make this app even more useful and complete. As of today, all major cloud juggernauts offer a plethora general-purpose ML services but when it comes to linguistic analysis which goes beyond sentiment tagging, IBM has risen to become a major player in this arena. IBM Watson Tone Analyser specifically targets understanding emotions and communication style using linguistic analysis to detect emotional, social and language tones in written text. Tones detected within the ‘General Purpose Endpoint’ include joy, fear, sadness, anger, disgust, analytical, confident, tentative, openness, conscientiousness, extraversion, agreeableness, and emotional range. Typical use cases for this service include analysing emotions and tones in what people write online, like tweets or reviews, predicting whether they are happy, sad, confident as well as monitoring customer service and support conversations, personalised marketing and of course chat bots. The following diagram shows the basic flow of calls to the service.

You authenticate to the Tone Analyzer API by providing the username and password that are provided in the service credentials for the service instance that you want to use. The API uses HTTP basic authentication. The request includes several parameters and their respective value options and the simplest way to kick some tires (after completing the sigh-up process) is to use curl tool command and some sample text for analysis e.g.

curl -v -u "username":"password" -H "Content-Type: text/plain" -d "I feel very happy today!"
"https://gateway.watsonplatform.net/tone-analyzer/api/v3/tone?version=2016-05-19"
{
   "document_tone": {
      "tone_categories": [
         {
            "tones": [
               {
                  "score": 0.013453,
                  "tone_id": "anger",
                  "tone_name": "Anger"
               },
               {
                  "score": 0.017433,
                  "tone_id": "disgust",
                  "tone_name": "Disgust"
               },
               {
                  "score": 0.039234,
                  "tone_id": "fear",
                  "tone_name": "Fear"
               },
               {
                  "score": 0.857981,
                  "tone_id": "joy",
                  "tone_name": "Joy"
               },
               {
                  "score": 0.062022,
                  "tone_id": "sadness",
                  "tone_name": "Sadness"
               }
            ],
            "category_id": "emotion_tone",
            "category_name": "Emotion Tone"
         },
         {
            "tones": [
               {
                  "score": 0,
                  "tone_id": "analytical",
                  "tone_name": "Analytical"
               },
               {
                  "score": "0.849827",
                  "tone_id": "confident",
                  "tone_name": "Confident"
               },
               {
                  "score": 0,
                  "tone_id": "tentative",
                  "tone_name": "Tentative"
               }
            ],
            "category_id": "language_tone",
            "category_name": "Language Tone"
         },
         {
            "tones": [
               {
                  "score": 0.016275,
                  "tone_id": "openness_big5",
                  "tone_name": "Openness"
               },
               {
                  "score": 0.262399,
                  "tone_id": "conscientiousness_big5",
                  "tone_name": "Conscientiousness"
               },
               {
                  "score": 0.435574,
                  "tone_id": "extraversion_big5",
                  "tone_name": "Extraversion"
               },
               {
                  "score": 0.679046,
                  "tone_id": "agreeableness_big5",
                  "tone_name": "Agreeableness"
               },
               {
                  "score": 0.092516,
                  "tone_id": "emotional_range_big5",
                  "tone_name": "Emotional Range"
               }
            ],
            "category_id": "social_tone",
            "category_name": "Social Tone"
         }
      ]
   }
}

The service returns JSON structure which can be further unpacked and analysed/visualised. Using their SDK and a little bit of Python we can create a little script that will pass the desired text to the Tone Analyser API and return a matplotlib graph chart visualising each tone value within its respective category. Below is a simple visualisation of a paragraph containing text with linguistically-negative sentiment and the Python code generating it.

import matplotlib.pyplot as plt
import numpy as np
import matplotlib as mpl
import watson_developer_cloud as wdc

tone_analyzer = wdc.ToneAnalyzerV3(
  version='2016-05-19',
  username='username',
  password='password',
  x_watson_learning_opt_out=True
)

message = 'Hi Team, I know the times are difficult! \
Our sales have been disappointing for the \
past three quarters for our data analytics \
product suite. We have a competitive data \
analytics product suite in the industry. \
But we need to do our job selling it!'

tone=tone_analyzer.tone(message, sentences=False, content_type='text/plain')

#assign each tone name and value to its respective category 
emotion_tone={}
language_tone={}
social_tone={}

for cat in tone['document_tone']['tone_categories']:
    print('Category:', cat['category_name'])
    if cat['category_name'] == 'Emotion Tone':
        for tone in cat['tones']:
            print('-', tone['tone_name'], tone['score'])
            emotion_tone.update({tone['tone_name']:tone['score']})     
    if cat['category_name'] == 'Social Tone':
        for tone in cat['tones']:
            print('-', tone['tone_name'], tone['score'])
            social_tone.update({tone['tone_name']:tone['score']}) 
    if cat['category_name'] == 'Language Tone':
        for tone in cat['tones']:
            print('-', tone['tone_name'], tone['score'])
            language_tone.update({tone['tone_name']:tone['score']})             


#find largest value in all tones to adjust the x scale accordingly
max_tone_value = {**emotion_tone, **language_tone, **social_tone}
if max(max_tone_value.values()) > 0.9:
    max_tone_value = 1
else:
    max_tone_value = max(max_tone_value.values())+0.1


#plot all tones by category
fig = plt.figure(figsize=(7,7))
mpl.style.use('seaborn')
fig.suptitle('Tones by Intensity, scale range: 0(min) - 1(max)', fontsize=14, fontweight='bold')

x1=fig.add_subplot(311)
y_pos = np.arange(len(emotion_tone.keys()))
plt.barh(y_pos, emotion_tone.values(), align='center', alpha=0.6, color='limegreen')
plt.yticks(y_pos, emotion_tone.keys())
plt.title('Emotion Tone', fontsize=12)
x1.set_xlim([0, max_tone_value])

x2=fig.add_subplot(312)
y_pos = np.arange(len(social_tone.keys()))
plt.barh(y_pos, social_tone.values(), align='center', alpha=0.6,color='red')
plt.yticks(y_pos, social_tone.keys())
plt.title('Social Tone',fontsize=12)
x2.set_xlim([0, max_tone_value])

x3=fig.add_subplot(313)
y_pos = np.arange(len(language_tone.keys()))
plt.barh(y_pos, language_tone.values(), height = 0.4, align='center', alpha=0.6, color='deepskyblue')
plt.yticks(y_pos, language_tone.keys())
plt.title('Language Tone',fontsize=12)
x3.set_xlim([0, max_tone_value])

plt.tight_layout(pad=0.9, w_pad=0.5, h_pad=1.7)
fig.subplots_adjust(top=0.85, left=0.20)
plt.show()

And finally, the amended Python code for the complete application (including AWS Polly integration from Part 1) is as follows:

import sys
import time
import io
from contextlib import closing
import multiprocessing
import pygame
import numpy as np
import pyodbc
import boto3
import watson_developer_cloud as wdc
import tkinter as tk
from tkinter import scrolledtext, ttk, messagebox
import matplotlib as mpl
import matplotlib.pyplot as plt
from matplotlib.backends.backend_tkagg import FigureCanvasTkAgg


class ConnectionInfo:
    def __init__(self):
        self.use_win_auth = tk.IntVar()
        self.inst_srv = tk.StringVar()
        self.inst_db = tk.StringVar()
        self.inst_login = tk.StringVar()
        self.inst_passwd = tk.StringVar()
        self.session_id = tk.IntVar()
        self.use_aws_api = tk.IntVar(value=1)
        self.aws_access_key_id = tk.StringVar()
        self.aws_secret_access_key = tk.StringVar()
        self.use_ibm_api = tk.IntVar(value=1)
        self.ibm_username = tk.StringVar()
        self.ibm_passwd = tk.StringVar()
        self.clinician_voice = tk.StringVar()
        self.patient_voice = tk.StringVar()

        self.ibm_version = '2016-05-19'
        self.ibm_x_watson_learning_opt_out = True


class MsSqlDatabase:
    ODBC_DRIVER = '{ODBC Driver 13 for SQL Server}'

    def __init__(self, conn_info):
        self.conn_info = conn_info

    def connect(self):
        connection_string = ('DRIVER={driver};SERVER={server};DATABASE={db};'.format(
            driver=self.ODBC_DRIVER,
            server=self.conn_info.inst_srv.get(),
            db=self.conn_info.inst_db.get()))
        if self.conn_info.use_win_auth.get() == 1:
            connection_string = connection_string + 'Trusted_Connection=yes;'
        else:
            connection_string = connection_string + 'UID={uid};PWD={password};'.format(
                uid=self.conn_info.inst_login.get(),
                password=self.conn_info.inst_passwd.get())

        try:
            conn = pyodbc.connect(connection_string, timeout=1)
        except pyodbc.Error as err:
            conn = None
        return conn

    def get_session(self, conn):
        try:
            cursor = conn.cursor()
            cursor.execute(
                """SELECT UPPER(user_role), message_body FROM dbo.test_dialog t
                WHERE t.Session_ID = ? ORDER BY t.ID ASC""", self.conn_info.session_id.get())
            results = cursor.fetchall()
        except pyodbc.Error as err:
            results = None
        return results

    def get_user_id(self, conn):
        try:
            cursor = conn.cursor()
            cursor.execute(
                """SELECT DISTINCT user_id from dbo.test_dialog t
                WHERE t.session_id = ? AND user_role = 'client'""", self.conn_info.session_id.get())
            results = cursor.fetchall()
        except pyodbc.Error as err:
            results = None
        return results

    def get_messages(self, conn):
        try:
            cursor = conn.cursor()
            cursor.execute(
                """SELECT t.user_role, t.direction, LTRIM(RTRIM(f.RESULT)) AS message FROM dbo.test_dialog t
                CROSS APPLY dbo.tvf_getConversations (t.message_body, 50, '.') f WHERE t.session_id = ?
                ORDER BY t.id, f.id""", self.conn_info.session_id.get())
            results = cursor.fetchall()
        except pyodbc.Error as err:
            results = None
        return results

    def get_messages_for_tone_analyse(self, conn):
        try:
            cursor = conn.cursor()
            cursor.execute(
                """DECLARE @message VARCHAR(MAX) 
                SELECT @message = COALESCE(@message + ' ', '') + message_body 
                FROM dbo.test_dialog t WHERE t.Session_ID = ? AND user_role = 'client' ORDER BY t.ID ASC
                SELECT @message""", self.conn_info.session_id.get())
            results = cursor.fetchall()
            results = [row[0] for row in results]
            try:
                messages = ''.join(results)
            except TypeError:
                messages = None
        except pyodbc.Error as err:
            messages = None
        return messages


class AudioPlayer:
    def __init__(self, credentials, voices):
        self.credentials = credentials
        self.voices = voices

    def run(self, messages, voices, commands, status):
        status['code'] = 0
        status['message'] = 'OK'

        try:
            polly_service = boto3.client(
                'polly',
                aws_access_key_id = self.credentials['aws_access_key'],
                aws_secret_access_key = self.credentials['aws_secret_key'],
                region_name = 'eu-west-1')
        except:
            polly_service = None

        if not polly_service:
            status['code'] = 1
            status['message'] = 'Cannot connect to AWS Polly service. Please check your API credentials are valid.'
            return

        is_stopped = False
        is_paused = False
        pygame.mixer.init(channels=1, frequency=44100)
        for message in messages:
            print(message)

            try:
                polly_response = polly_service.synthesize_speech(
                    OutputFormat='ogg_vorbis',
                    Text=message[2],
                    TextType='text',
                    VoiceId=voices[message[0]])
            except:
                polly_response = None

            if not polly_response:
                status['code'] = 2
                status['message'] = 'Cannot connect to AWS Polly service. Please check your API credentials are valid.'
                break

            if "AudioStream" in polly_response:
                with closing(polly_response["AudioStream"]) as stream:
                    data = stream.read()
                    filelike = io.BytesIO(data)
                    sound = pygame.mixer.Sound(file=filelike)
                    sound.play()

                    while pygame.mixer.get_busy() or is_paused:
                        if not commands.empty():
                            command = commands.get()
                            if command == 'STOP':
                                sound.stop()
                                is_stopped = True
                                break
                            if command == 'PAUSE':
                                is_paused = not is_paused
                                if is_paused:
                                    sound.stop()
                                else:
                                    sound.play()
                        time.sleep(0.010)
            if is_stopped:
                break


class AppFrame(object):
    def __init__(self):
        self.root = tk.Tk()
        self.root.title('Polly Text-To-Speech GUI Prototype ver 1.1')
        self.root.resizable(width=False, height=False)

        self.conn_info = ConnectionInfo()

        self.menubar = self.create_menubar()
        self.connection_details_frame = ConnDetailsFrame(self.root, self)
        self.session_frame = SessionDetailsFrame(self.root, self)
        self.playback_frame = PlaybackDetailsFrame(self.root, self)
        self.graph_frame = WatsonGraphDetailsFrame(self.root, self)

    def create_menubar(self):
        menubar = tk.Menu(self.root)

        title_menu = tk.Menu(menubar, tearoff=0)
        title_menu.add_command(label='API details...', command=self.on_api_details_select)
        title_menu.add_command(label='About...', command=self.on_about_select)
        menubar.add_cascade(label='About', menu=title_menu)
        self.root.config(menu=menubar)

        return menubar

    def on_api_details_select(self):
        dialog = APIDetailsDialog(self.root)
        self.root.wait_window(dialog)

    def on_about_select(self):
        tk.messagebox.showinfo(title="About", message="Polly Text-To-Speech GUI Prototype ver 1.1")

    def run(self):
        self.root.mainloop()


class ConnDetailsFrame(ttk.LabelFrame):

    def __init__(self, root, parent):
        super(ConnDetailsFrame, self).__init__(root, text='1. Connection Details')
        super(ConnDetailsFrame, self).grid(
            row=0, column=0, columnspan=3, sticky='W',
            padx=5, pady=5, ipadx=5, ipady=5
        )

        self.root = root
        self.parent = parent
        self.conn_info = parent.conn_info

        self.create_notebook()

    def create_notebook(self):
        self.tab_control = ttk.Notebook(self)
        self.create_frames()
        self.create_labels()
        self.create_entry()
        self.create_checkbuttons()

    def create_frames(self):
        self.tab_db = ttk.Frame(self.tab_control)
        self.tab_api = ttk.Frame(self.tab_control)
        self.tab_control.add(self.tab_db, text="Database Connection Details ")
        self.tab_control.add(self.tab_api, text="APIs Connection Details ")
        self.tab_control.grid(row=0, column=0, sticky='E', padx=5, pady=5)

    def create_labels(self):
        ttk.Label(self.tab_db, text="Server/Instance Name:").grid(row=0, column=0, sticky='E', padx=5, pady=(15, 5))
        ttk.Label(self.tab_db, text="Database Name:").grid(row=1, column=0, sticky='E', padx=5, pady=5)
        ttk.Label(self.tab_db, text="User Name:").grid(column=0, row=3, sticky="E", padx=5, pady=5)
        ttk.Label(self.tab_db, text="Password:").grid(column=0, row=4, sticky="E", padx=5, pady=(5, 10))        

        ttk.Label(self.tab_api, text="AWS Access Key ID:").grid(column=0, row=1, sticky="E", padx=5, pady=(5, 5))
        ttk.Label(self.tab_api, text="AWS Secret Access Key:").grid(column=0, row=2, sticky="E", padx=5, pady=5)
        ttk.Label(self.tab_api, text="IBM Watson Username:").grid(column=0, row=4, sticky="E", padx=5, pady=(5, 5))
        ttk.Label(self.tab_api, text="IBM Watson Password:").grid(column=0, row=5, sticky="E", padx=5, pady=(5,15))
    
    def create_checkbuttons(self):
        check_use_win_auth = ttk.Checkbutton(self.tab_db, onvalue=1, offvalue=0,
                                             variable=self.conn_info.use_win_auth,
                                             text='Use Windows Authentication',
                                             command=self.on_use_win_auth_change)
        check_use_win_auth.grid(row=2, column=0, sticky='W', padx=15, pady=(15,5))  
        check_use_aws_api = ttk.Checkbutton(self.tab_api, onvalue=1, offvalue=0,
                                            variable=self.conn_info.use_aws_api, text='Use AWS Text-To-Speech API')
        check_use_aws_api.grid(row=0, column=0, sticky='W', padx=15, pady=(15,5)) 
        check_use_ibm_api = ttk.Checkbutton(self.tab_api, onvalue=1, offvalue=0,
                                            variable=self.conn_info.use_ibm_api,
                                            text='Use IBM Watson API',
                                            command=self.on_use_ibm_api_change)
        check_use_ibm_api.grid(row=3, column=0, sticky='W', padx=15, pady=(15,5))   

    def create_entry(self):
        entry_db_server_name = ttk.Entry(self.tab_db, width=60, textvariable=self.conn_info.inst_srv)
        entry_db_server_name.grid(row=0, column=1, sticky='W', padx=10, pady=(15, 5))
        entry_db_name = ttk.Entry(self.tab_db, width=60, textvariable=self.conn_info.inst_db)
        entry_db_name.grid(row=1, column=1, sticky='W', padx=10, pady=5)                                      
        self.entry_db_user_name = ttk.Entry(self.tab_db, width=60, textvariable=self.conn_info.inst_login)
        self.entry_db_user_name.grid(row=3, column=1, padx=10, pady=5)
        self.entry_db_password = ttk.Entry(self.tab_db, width=60, textvariable=self.conn_info.inst_passwd, show="*")
        self.entry_db_password.grid(row=4, column=1, padx=10, pady=(5, 10))

        entry_aws_access_key = ttk.Entry(self.tab_api, width=60,
                                         textvariable=self.conn_info.aws_access_key_id)
        entry_aws_access_key.grid(row=1, column=1, sticky='W', padx=10, pady=(5, 5))
        entry_aws_secret_key = ttk.Entry(self.tab_api, width=60,
                                         textvariable=self.conn_info.aws_secret_access_key)
        entry_aws_secret_key.grid(row=2, column=1, padx=5, pady=5)         
        self.entry_ibm_username = ttk.Entry(self.tab_api, width=60,
                                            textvariable=self.conn_info.ibm_username)
        self.entry_ibm_username.grid(row=4, column=1, padx=5, pady=5)
        self.entry_ibm_password = ttk.Entry(self.tab_api, width=60,
                                            textvariable=self.conn_info.ibm_passwd,show="*")
        self.entry_ibm_password.grid(row=5, column=1, padx=5, pady=(5,15))

    def on_use_win_auth_change(self):
        if (self.conn_info.use_win_auth.get() == 1):
            self.entry_db_user_name.configure(state='disabled')
            self.entry_db_password.configure(state='disabled')
        else:
            self.entry_db_user_name.configure(state='normal')
            self.entry_db_password.configure(state='normal')

    def on_use_ibm_api_change(self):
        if (self.conn_info.use_ibm_api.get() == 0):
            self.entry_ibm_username.configure(state='disabled')
            self.entry_ibm_password.configure(state='disabled')
        else:
            self.entry_ibm_username.configure(state='normal')
            self.entry_ibm_password.configure(state='normal')


class SessionDetailsFrame(ttk.LabelFrame):
    def __init__(self, root, parent):
        super(SessionDetailsFrame, self).__init__(root, text='2. Session Details')
        super(SessionDetailsFrame, self).grid(row=1, column=0, sticky='NW', padx=5, pady=5, ipadx=5, ipady=5, rowspan=2)

        self.parent = parent
        self.conn_info = parent.conn_info

        self.create_entries()
        self.create_buttons()
        self.create_scrolled_text()

    def create_entries(self):
        ttk.Entry(
            self, justify="center", width=18, font="Helvetica 18 bold",
            textvariable=self.conn_info.session_id).grid(row=1, column=2, padx=3, pady=5, sticky='W')

    def create_buttons(self):
        search_session_btn = ttk.Button(self, text="SEARCH SESSION ID", command=self.on_search_session_click)
        search_session_btn.grid(row=1, column=3, ipadx=8, ipady=6)

    def create_scrolled_text(self):
        self.dialog_st = scrolledtext.ScrolledText(self, width=45, height=13, wrap=tk.WORD)
        self.dialog_st.grid(column=2, row=2, padx=4, pady=4, columnspan=2, sticky='w')

        style = ttk.Style()
        style.configure("TButton", foreground="red")

    def on_search_session_click(self):
        db = MsSqlDatabase(self.conn_info)
        conn = db.connect()
        if conn:
            results = db.get_session(conn)
            if results:
                self.dialog_st.delete('1.0', tk.END)
                for role, message in results:
                    self.dialog_st.insert(tk.END, '{}:\n'.format(role), 'role')
                    self.dialog_st.insert(tk.END, '{}\n\n'.format(message), 'message')
                    self.dialog_st.tag_config('role', foreground='red', font="Courier 11 bold")
            else:
                tk.messagebox.showwarning(title="Warning", message="Nominated Session ID not found in the database!")
        else:
            tk.messagebox.showwarning(title="Warning", message="Cannot connect to database server!")


class PlaybackDetailsFrame(ttk.LabelFrame):
    def __init__(self, root, parent):
        super(PlaybackDetailsFrame, self).__init__(root, text='3. Playback Details')
        super(PlaybackDetailsFrame, self).grid(row=1, column=1, sticky='WN', padx=5, pady=5, ipadx=5, ipady=5)

        self.root = root
        self.parent = parent
        self.conn_info = parent.conn_info

        self.create_labels()
        self.create_combobox()
        self.create_buttons()

        root.protocol('WM_DELETE_WINDOW', self.on_closing)

        self.process_manager = multiprocessing.Manager()
        self.player_process = None
        self.player_commands = None
        self.player_status = None

    def create_labels(self):
        l1 = ttk.Label(self, text="Clinician Voice:").grid(row=0, column=0, sticky='W', padx=5, pady=5)
        l2 = ttk.Label(self, text="Patient Voice:").grid(row=0, column=1, sticky='W', padx=5, pady=5)
        var1 = tk.StringVar(self.root)
        var2 = tk.StringVar(self.root)

    def create_combobox(self):
        clinician = ttk.Combobox(self, width=11, textvariable=self.conn_info.clinician_voice)
        clinician.grid(row=1, column=0, padx=5, pady=5, sticky='NW')
        clinician['values'] = (
            'Russell',
            'Nicole',
            'Amy',
            'Brian',
            'Emma',
            'Raveena',
            'Ivy',
            'Joanna',
            'Joey',
            'Justin',
            'Kendra',
            'Kimberly',
            'Salli'
        )
        clinician.current(0)

        patient = ttk.Combobox(self, width=11, textvariable=self.conn_info.patient_voice)
        patient.grid(row=1, column=1, padx=(5, 0), pady=5, sticky='NW')
        patient['values'] = (
            'Nicole',
            'Russell',
            'Amy',
            'Brian',
            'Emma',
            'Raveena',
            'Ivy',
            'Joanna',
            'Joey',
            'Justin',
            'Kendra',
            'Kimberly',
            'Salli')
        patient.current(0)

    def create_buttons(self):
        play_session_btn = ttk.Button(self, text="PLAY", width=25, command=self.on_play_session_click)
        play_session_btn.grid(row=2, column=0, columnspan=2, padx=(10, 2), pady=(20, 5), sticky='WE')
        pause_session_btn = ttk.Button(self, text="PAUSE", width=25, command=self.on_pause_session_click)
        pause_session_btn.grid(row=3, column=0, columnspan=2, padx=(10, 2), pady=5, sticky='WE')
        stop_session_btn = ttk.Button(self, text="STOP", width=25, command=self.on_stop_session_click)
        stop_session_btn.grid(row=4, column=0, columnspan=2, padx=(10, 2), pady=(5, 5), sticky='WE')

    def on_play_session_click(self):
        if self.player_process:
            if self.player_process.is_alive():
                self.player_commands.put('STOP')

        db = MsSqlDatabase(self.conn_info)
        db_conn = db.connect()
        if db_conn:
            messages = db.get_messages(db_conn)
            if messages:
                is_credentials_valid = True
                if len(self.conn_info.aws_access_key_id.get()) == 0 or \
                    len(self.conn_info.aws_secret_access_key.get()) == 0:
                        is_credentials_valid = False

                if (is_credentials_valid):
                    credentials = {
                        'aws_access_key': self.conn_info.aws_access_key_id.get(),
                        'aws_secret_key': self.conn_info.aws_secret_access_key.get()
                    }
                    voices = {
                        'clinician': self.conn_info.clinician_voice.get(),
                        'client': self.conn_info.patient_voice.get()
                    }
                    player = AudioPlayer(credentials, voices)

                    self.player_commands = self.process_manager.Queue()
                    self.player_status = self.process_manager.dict()
                    self.player_process = multiprocessing.Process(
                        target=player.run,
                        args=(messages, voices, self.player_commands, self.player_status))
                    self.player_process.start()
                    self.root.after(500, lambda: self.check_player_status(self.player_process, self.player_status))
                else:
                    tk.messagebox.showwarning(title="Warning", message="AWS access or secret key is empty")
            else:
                tk.messagebox.showwarning(title="Warning", message="Nominated Session ID not found in the database!")
        else:
            tk.messagebox.showwarning(title="Warning", message="Cannot connect to database server")

    def on_pause_session_click(self):
        if self.player_commands:
            self.player_commands.put("PAUSE")

    def on_stop_session_click(self):
        if self.player_commands:
            self.player_commands.put("STOP")

    def on_closing(self):
        if self.player_process:
            if self.player_process.is_alive():
                self.player_commands.put('STOP')
            self.player_process.join()

        self.root.destroy()

    def check_player_status(self, player_process, player_status):
        if not player_process.is_alive():
            print('Player status: {}, {}'.format(player_status['code'], player_status['message']))
            if player_status['code'] != 0:
                tk.messagebox.showwarning(title="Warning", message=player_status['message'])
        else:
            self.root.after(500, lambda: self.check_player_status(player_process, player_status))


class WatsonGraphDetailsFrame(ttk.LabelFrame):
    def __init__(self, root, parent):
        super(WatsonGraphDetailsFrame, self).__init__(root, text='4. Analysis Graph Details')
        super(WatsonGraphDetailsFrame, self).grid(row=2, column=1, sticky='WE', padx=5, pady=5, ipadx=5, ipady=1)

        self.root = root
        self.parent = parent
        self.conn_info = parent.conn_info

        self.create_buttons()

    def create_buttons(self):
        self.tone_analysis_btn = ttk.Button(self, text='PERFORM TONE ANALYSIS', width=28,
                                                      command=self.tone_analysis_btn_click)
        self.tone_analysis_btn.grid(row=0, column=1, padx=(12, 2), pady=(11, 11), sticky='EW')

    def tone_analysis_btn_click(self):
        if len(self.conn_info.ibm_username.get()) == 0 or len(self.conn_info.ibm_passwd.get()) == 0 \
            or self.conn_info.use_ibm_api.get() == 0:
                tk.messagebox.showwarning(title='Warning',
                                          message='\'IBM Watson API\' username or password is empty or disabled')
                return

        db = MsSqlDatabase(self.conn_info)
        conn = db.connect()
        if not conn:
            tk.messagebox.showwarning(title='Warning', message='Cannot connect to database server!')
            return

        messages = db.get_messages_for_tone_analyse(conn)
        if not messages:
            tk.messagebox.showwarning(title='Warning', message='Nominated Session ID not found in the database!')
            return

        if len(messages.split()) < 3: tk.messagebox.showwarning(title='Warning', message='Too few words provided!') return if sys.getsizeof(messages) > 128000:
            tk.messagebox.showwarning(title='Warning', message='The message provided is too long for API string limit.')
            return

        db_user_id = db.get_user_id(conn)
        if not db_user_id:
            tk.messagebox.showwarning(title='Warning', message='Cannot get User ID for given Session ID')
            return
        client = { 'session_id': self.conn_info.session_id.get(),
                   'user_id': db_user_id[0][0] }

        tone_analyzer = wdc.ToneAnalyzerV3(
            version=self.conn_info.ibm_version,
            username=self.conn_info.ibm_username.get(),
            password=self.conn_info.ibm_passwd.get(),
            x_watson_learning_opt_out=self.conn_info.ibm_x_watson_learning_opt_out
        )

        try:
            tone = tone_analyzer.tone(messages, sentences=False, content_type='text/plain')
        except:
            tk.messagebox.showwarning(title='Warning', message='Cannot connect to IBM Watson service')
            return

        dialog = ToneAnalysisDialog(self, client, tone)
        self.wait_window(dialog)


class APIDetailsDialog(tk.Toplevel):
    def __init__(self, parent):
        super(APIDetailsDialog, self).__init__(parent)
        self.parent = parent

        self.title('API Details')
        self.resizable(width=False, height=False)

        frame = ttk.LabelFrame(self, text="Polly Text-To-Speech GUI Prototype API Details")
        ttk.Label(frame, text="Text to Speech API:").grid(row=0, column=0, sticky='W')
        ttk.Label(frame, text="AWS Polly").grid(row=1, column=0, sticky='W', pady=(0, 10))
        ttk.Label(frame, text="Tone Analyser API:").grid(row=2, column=0, sticky='W')
        ttk.Label(frame, text="IBM Watson").grid(row=3, column=0, sticky='W')
        frame.pack(side=tk.TOP, fill=tk.BOTH, padx=10, pady=10)

        close_btn = ttk.Button(self, text='Close', command=self.on_close_btn_click)
        close_btn.pack(padx=5, pady=5, side=tk.BOTTOM)

        self.update_idletasks()
        w = self.winfo_width()
        h = self.winfo_height()
        x = (self.winfo_screenwidth() - w) // 2
        y = (self.winfo_screenheight() - h) // 2
        self.geometry('{}x{}+{}+{}'.format(w, h, x, y))
        self.grab_set()

    def on_close_btn_click(self):
        self.destroy()


class ToneAnalysisDialog(tk.Toplevel):
    def __init__(self, parent, client, tone):
        super(ToneAnalysisDialog, self).__init__(parent)

        self.parent = parent
        self.client = client
        self.tone = tone

        self.title('Tone Analysis')

        plot_widget = self.create_tone_analyse_plot()
        plot_widget.pack(side=tk.TOP, fill=tk.BOTH, expand=1)

        close_btn = ttk.Button(self, text='Close', command=self.on_close_btn_click)
        close_btn.pack(padx=5, pady=5, side=tk.BOTTOM)
        self.grab_set()

    def create_tone_analyse_plot(self):
        emotion_tone = {}
        language_tone = {}
        social_tone = {}

        for cat in self.tone['document_tone']['tone_categories']:
            print('Category:', cat['category_name'])
            if cat['category_name'] == 'Emotion Tone':
                for tone in cat['tones']:
                    print('-', tone['tone_name'], tone['score'])
                    emotion_tone.update({tone['tone_name']: tone['score']})
            if cat['category_name'] == 'Social Tone':
                for tone in cat['tones']:
                    print('-', tone['tone_name'], tone['score'])
                    social_tone.update({tone['tone_name']: tone['score']})
            if cat['category_name'] == 'Language Tone':
                for tone in cat['tones']:
                    print('-', tone['tone_name'], tone['score'])
                    language_tone.update({tone['tone_name']: tone['score']})

        max_tone_values = list(emotion_tone.values()) + list(language_tone.values()) + list(social_tone.values())
        if max(max_tone_values) > 0.9:
            max_tone_value = 1
        else:
            max_tone_value = max(max_tone_values) + 0.1

        mpl.style.use('seaborn')

        fig = mpl.figure.Figure(figsize=(7, 7))
        canvas = FigureCanvasTkAgg(fig, master=self)

        fig.suptitle(
            'Tones Analysis of Patient ID \'{}\', Chat Data for Session ID \'{}\'\nScale range: 0 (min) -- 1 (max)'
                .format(self.client['user_id'], self.client['session_id']), fontsize=14, fontweight='bold')

        keys = sorted(emotion_tone.keys(), reverse=True)
        values = [emotion_tone[key] for key in keys]
        y_pos = np.arange(len(values))
        ax1 = fig.add_subplot(311)
        ax1.barh(y_pos, values, align='center', alpha=0.6, color='limegreen')
        ax1.set_yticks(y_pos)
        ax1.set_yticklabels(keys)
        ax1.set_title('Emotion Tone', fontsize=12)
        ax1.set_xlim([0, max_tone_value])

        keys = sorted(social_tone.keys(), reverse=True)
        values = [social_tone[key] for key in keys]
        y_pos = np.arange(len(values))
        ax2 = fig.add_subplot(312)
        ax2.barh(y_pos, values, align='center', alpha=0.6, color='red')
        ax2.set_yticks(y_pos)
        ax2.set_yticklabels(keys)
        ax2.set_title('Social Tone', fontsize=12)
        ax2.set_xlim([0, max_tone_value])

        keys = sorted(language_tone.keys(), reverse=True)
        values = [language_tone[key] for key in keys]
        y_pos = np.arange(len(values))
        ax3 = fig.add_subplot(313)
        ax3.barh(y_pos, values, height=0.4, align='center', alpha=0.6, color='deepskyblue')
        ax3.set_yticks(y_pos)
        ax3.set_yticklabels(keys)
        ax3.set_title('Language Tone', fontsize=12)
        ax3.set_xlim([0, max_tone_value])

        fig.tight_layout(pad=0.9, w_pad=0.5, h_pad=1.7)
        fig.subplots_adjust(top=0.85, left=0.20)

        canvas.show()
        widget = canvas.get_tk_widget()

        return widget

    def on_close_btn_click(self):
        self.destroy()

if __name__ == "__main__":
    app = AppFrame()
    app.run()

This concludes this two-part series on building a simple GUI app in Python and Tkinter using AWS and IBM machine learning cloud services. Now you can see that anyone, with a little bit of elbow grease, minimal Python skills and little bit of time to spare (no PhD required!) can take advantage of these machine learning services and create something interesting.

http://scuttle.org/bookmarks.php/pass?action=add

Tags: , , , , , , , ,

This entry was posted on Wednesday, August 23rd, 2017 at 6:27 am and is filed under Uncategorized. You can follow any responses to this entry through the RSS 2.0 feed. You can leave a response, or trackback from your own site.

One Response to “Using AWS Polly And IBM Watson Text-To-Speech And Tone Analyser Artificial Intelligence Services To Read and Analyse Clinical Chat Data (Part 2)”

Martin Cropper April 15th, 2018 at 12:17 pm

Any chance you could do a short summary of how to simply integrate the output of a Watson conversation into AWS Polly on a dynamic basis – ie just have polly speak the output of a watson chatbot as it appears live?

Leave a Reply