postgresml
diff --git a/‎pgml-apps/discord-bot/README.md
Copy file name to clipboardExpand all lines: pgml-apps/discord-bot/README.md
+38-74Lines changed: 38 additions & 74 deletions b/‎pgml-apps/discord-bot/README.md
Copy file name to clipboardExpand all lines: pgml-apps/discord-bot/README.md
+38-74Lines changed: 38 additions & 74 deletions
diff --git a/‎pgml-apps/discord-bot/bot.py
Copy file name to clipboardExpand all lines: pgml-apps/discord-bot/bot.py
+14-4Lines changed: 14 additions & 4 deletions b/‎pgml-apps/discord-bot/bot.py
Copy file name to clipboardExpand all lines: pgml-apps/discord-bot/bot.py
+14-4Lines changed: 14 additions & 4 deletions
@@ -1,6 +1,6 @@
 # Discord Bot using pgml Python SDK, Langchain, Instructor-xl, and Falcon 7B
 
-In this tutorial, we will build a Discord bot that can use markdown files to help answer user inquiries. We will ingest the files, convert their contents into vector embeddings, and save them to Postgres. After indexing the data, the bot will query the collection to retrieve the documents that are most likely to answer the user's question. Then, we will use a simple SQL query utilizing PostgresML to retrieve a completion from the open source Falcon 7b text generation model. Finally, we will return this completion to the user in the Discord channel. We will be using the pgml SDK to simplify the process.
+In this tutorial, we will build a Discord bot that can use markdown files to help answer user inquiries. We will ingest the files, convert their contents into vector embeddings, and save them to Postgres. After indexing the data, the bot will query the collection to retrieve the documents that are most likely to answer the user's question. Then, we will use a simple SQL query utilizing PostgresML to retrieve a completion from the open source Falcon-7B-Instruct text generation model. Finally, we will return this completion to the user in the Discord channel. We will be using the [pgml python SDK](https://pypi.org/project/pgml/) to simplify the process.
 
 In this project, we will be working with three files:
 
@@ -20,7 +20,7 @@ To create a Discord bot, you will need to create a Discord bot account and get a
 
 Next, set the name of the Discord channel you would like the bot to listen to. Set this to the variable `DISCORD_CHANNEL` in your .env file.
 
-We will be using the pgml Python SDK to create, store, and query our vectors. So, if you don't already have an account there, you can create one here: https://postgresml.org/. You can select the free serverless option and will be given a connection string. Set this connection string to the variable `PGML_CONNECTION_STR` in your .env file.
+We will be using the pgml Python SDK to create, store, and query our vectors. So, if you don't already have an account there, you can create one here: https://postgresml.org/. You can select the free serverless option and will be given a connection string. Set this connection string to the variable `pgml_CONNECTION_STR` in your .env file.
 
 Next, you will want to add the markdown files you would like to use into the `./content` folder. Set the path to this folder to the variable `CONTENT_PATH` in your .env file.
 
@@ -34,33 +34,25 @@ Open and run the cells in the `./ingest.ipynb` notebook. If you have set all of
 
 Let's take a look at what is happening in the notebook.
 
-1. We load in the markdown files from the path we passed in, using Lanchain's document loader.
-2. We convert this array of documents to an array of dictionaries in the format expected by the PGML SDK.
+1. We load in the markdown files from the path we passed in, using Langchain's document loader.
+2. We convert this array of documents to an array of dictionaries in the format expected by the pgml SDK.
 
 ```
-
 docs = [{text: 'foo'}, {text: 'bar'}, ...]
-
 ```
 
-1. We create a PGML collection upserting those documents into a PostgreML Collection.
+1. We create a pgml collection upserting those documents into a collection.
 
 ```
-
 collection = pgml.create_or_get_collection(collection_name)
-
 collection.upsert_documents(docs)
-
 ```
 
 1. We chunk those documents into smaller sizes and embed those chunks using the Instructor-XL model.
 
 ```
-
 collection.generate_chunks()
-
 collection.generate_embeddings()
-
 ```
 
 Now that our data is properly indexed, we can start our bot server to handle incoming requests, using the data we just ingested to help answer questions.
@@ -72,9 +64,7 @@ For our bot server, we are using the popular library [discord.py](https://discor
 To start the bot server, you can run the following command in your terminal:
 
 ```
-
 python start.py
-
 ```
 
 If everything was set up correctly in earlier steps, your bot should be fully functional.
@@ -84,10 +74,9 @@ But since it's good to know how things are working, let's take a look at the cod
 In the `start.py` file, you will see the following code:
 
 ```
-
 # get environment variables
 
-pg_connection_string = os.getenv("PGML_CONNECTION_STR")
+pg_connection_string = os.getenv("pgml_CONNECTION_STR")
 
 # ...
 
@@ -98,7 +87,6 @@ pgml_bot = Bot(conninfo=pg_connection_string)
 ## start discord bot
 
 pgml_bot.start(collection_name, discord_token)
-
 ```
 
 This code will initialize the bot class with your PostgreSQL connection string and then start the Discord bot with the collection name, from which you previously saved your data in the previous step, and Discord token.
@@ -109,80 +97,56 @@ We also declared the `on_message` function that is called when a message is sent
 
 When a message is handled by this `on_message` function, we do a few things:
 
-1. Using the PGML SDK, we run:
+1. Using the pgml SDK, we run:
 
 ```
-
 collection.vector_search(
-
-query,
-
-top_k=3,
-
-model_id=2,
-
-splitter_id=2,
-
-query_parameters={"instruction": "Represent the question for retrieving supporting documents: "},
-
+    query,
+    top_k=3,
+    model_id=2,
+    splitter_id=2,
+    query_parameters={"instruction": "Represent the question for retrieving supporting documents: "},
 )
-
 ```
 
 This is going to return the top 3 documents that are most similar to the user's message.
 
-1. We then concatenate the text of those documents into a single string and add it to our prompt text, which looks like:
+2. We then concatenate the text of those documents into a single string and add it to our prompt text, which looks like:
 
-````
-
-Use the context, which is delimited by three back ticks, below to help answer the question.
-
-context: ```{context}```
+```
+Answer the question as truthfully as possible using the provided text, and if the answer is not contained within the text below, say "I don't know!"
 
-{user_message}
+Context:
+{context}
 
-````
+QUESTION<<{message_content}
+ANSWER<<
+```
 
-1. Now that we have our prompt ready, we can make a Falcon completion. We will get this completion by executing a SQL query that uses `pgml.transform` function.
+3. Now that we have our prompt ready, we can make a Falcon completion. We will get this completion by executing a SQL query that uses `pgml.transform` function.
 
 ```
-
 async def run_transform_sql(self, context, message_content):
-
-prompt = self.prepare_prompt(context, message_content)
-
-sql_query = """SELECT pgml.transform(
-
-task => '{
-
-"model": "tiiuae/falcon-7b-instruct",
-
-"device_map": "auto",
-
-"torch_dtype": "bfloat16",
-
-"trust_remote_code": true
-
-}'::JSONB,
-
-args => '{
-
-"max_new_tokens": 100
-
-}'::JSONB,
-
-inputs => ARRAY[%s]
-
-) AS result"""
-
-sql_params = (prompt,)
-
-return await self.run_query(sql_query, sql_params)
+    prompt = self.prepare_prompt(context, message_content)
+    sql_query = """SELECT pgml.transform(
+    task => '{
+        "model": "tiiuae/falcon-7b-instruct",
+        "device_map": "auto",
+        "torch_dtype": "bfloat16",
+        "trust_remote_code": true
+        }'::JSONB,
+        args => '{
+        "max_new_tokens": 100
+        }'::JSONB,
+        inputs => ARRAY[%s]
+        ) AS result"""
+    sql_params = (prompt,)
+    return await self.run_query(sql_query, sql_params)
 
 ```
 
-1. Now that we have the response from Falcon, we need to clean the response text up a bit before returning the bot's answer. Since the completion text includes the original prompt, we will remove that from the generated text in the `prepare_response` function.
-2. Finally, we will send the response back to the Discord channel.
+4. Now that we have the response from Falcon, we need to clean the response text up a bit before returning the bot's answer. Since the completion text includes the original prompt, we will remove that from the generated text in the `prepare_response` function.
+5. Finally, we will send the response back to the Discord channel.
 
 ## Final Remarks
 
 
@@ -94,7 +94,9 @@ async def on_ready():
             print(f'We have logged in as {self.discord_client.user}')
 
         @self.discord_client.event
-        async def on_message(message):            
+        async def on_message(message):
+            print(f"Message from {message.author}: {message.content}")            
+            
             if message.author != self.discord_client.user and message.channel.name == channel_name:
                 await self.handle_message(collection_name, message)
 
@@ -107,13 +109,16 @@ async def handle_message(self, collection_name, message):
         res = await self.query_collection(collection_name, message.content)
         print(f"Found {len(res)} results")
         context = self.build_context(res)
+        print("Running Completion query")
         completion = await self.run_transform_sql(context, message.content)
+        print("Preparing response")
         response = self.prepare_response(completion, context, message.content)
+        print("Sending response")
         await message.channel.send(response)
 
     # Build the context for the message from search results
     def build_context(self, res):
-        return '\n'.join([f'***{r["chunk"]}***' for r in res])
+        return '\n'.join([f'{r["chunk"]}' for r in res])
 
     # Run a SQL function 'pgml.transform' to get a generated answer for the message
     async def run_transform_sql(self, context, message_content):
@@ -126,7 +131,7 @@ async def run_transform_sql(self, context, message_content):
                                 "trust_remote_code": true
                             }'::JSONB,
                             args => '{
-                                "max_new_tokens": 100
+                                "max_new_tokens": 200
                             }'::JSONB,
                             inputs => ARRAY[%s]
                         ) AS result"""
@@ -135,7 +140,12 @@ async def run_transform_sql(self, context, message_content):
 
     # Prepare the prompt to be used in the SQL function
     def prepare_prompt(self, context, message_content):
-        return f"""Use the context, which is delimited by three *'s, below to help answer the question.\ncontext: {context}\n{message_content}"""
+        return f"""Answer the question as truthfully as possible using the provided text, and if the answer is not contained within the text below, say "I don't know my lord!"
+
+Context:
+{context}
+QUESTION<<{message_content}
+ANSWER<<"""
 
     # Prepare the bot's response by removing the original prompt from the generated text
     def prepare_response(self, completion, context, message_content):