presenton/servers/fastapi/ppt_generator/generator.py

96 lines
6.4 KiB
Python

import os
from typing import AsyncIterator, List
from langchain_google_genai import ChatGoogleGenerativeAI
from langchain_openai import ChatOpenAI
from langchain_core.messages import SystemMessage, HumanMessage, AIMessageChunk
from ppt_generator.models.llm_models import LLMPresentationModel
CREATE_PRESENTATION_PROMPT = """
You're a professional presenter with years of experience in creating clear and engaging presentations.
Create a presentation using the provided slide titles, images, and additional data, following specified steps and guidelines.
Analyze all inputs, including slide titles, graphs, summary, big idea, story and spreadsheet content to construct each slide with appropriate content and format.
# Slide Types
- **1**: contains title, description and image.
- **2**: contains title and list of items.
- **4**: contains title and list of items with images.
- **5**: contains title, description and a graph.
- **6**: contains title, description and list of items.
- **7**: contains title and list of items with icons.
- **8**: contains title, description and list of items with icons.
# Steps
1. Analyze Prompt, and other provided data.
2. Use Slide titles provided in **Titles**.
3. Generate Slide Content for each slide. Make sure it has all the context and information required to create this individual slide from.
4. Select slide type.
5. Output should be in json format as per given schema.
6. **Adherence to schema should be beyond all the rules mentioned in notes.**
# Notes
- Generate output in language mentioned in *Input*.
- Distribute contexts mentioned in prompt to slides using **info** field.
- User prompt should be respected beyond all rules or constraints.
- If the presentation is academic, then make only take the chapter text as context and create presentation according to that text and structure. Don't assume or put text or context which is not in the text.
- If **Story** is provided, presentation should follow the story flow.
- When you have to express single numbers like percentage or figures, you should use inforgraphics but for a collection of numbers in series you can use charts.
- Freely select type with images and icons.
- Introduction and Conclusion should have *Type 1* if graph is not assigned.
- Try to select **different types for every slides**.
- Don't select Type **3** for any slide.
- Make sure to give presentation in said language. You must translate and understand given context and text is in any other language.
- Do not include same graph twice in presentation without any changes to the other.
- Every series in a graph should have data in same unit. Example: all series should be in percentage or all series should be in number of items.
- Type **9** and **5** should be only picked if graph is available.
- **Strictly keep the text under given limit.**
- For slide content follow these rules:
- Highlighting in markdown format should be used to emphasize numbers and data.
- Adhere to length contraints in **body** and **description**. Focus on direct communication within character constrainsts than lengthy explanation.
- **body** and **description** in slides should never exceed character limits of 200 characters.
- Specify **don't include text in image** in image prompt.
- All the numbers should be bolded with **bold** tag in body or description of slide.
- Image prompt should cleary define how image should look like.
- Image prompt should not ask to generate **numbers, graphs, dashboard and report**.
- Examples of image prompts:
- a travel agent presenting a detailed itinerary with photos of destinations, showcasing specific experiences, highlighting travel highlights
- a person smiling while traveling, with a beautiful background scenery, such as mountains, beach, or city, golden hour lighting
- a humanoid robot standing tall, gazing confidently at the horizon, bathed in warm sunlight, the background showing a futuristic cityscape with sleek buildings and flying vehicles
- Descriptions should be clear and to the point.
- Descriptions should not use words like "This slide", "This presentation".
- If **body** contains items, *choose number of items randomly between mentioned constraints.*
- **Icon queries** must be a generic **single word noun**.
- Provide 3 icon query for each icon where,
- First one should be specific like "Led bulb".
- Second one should be more generic that first like "bulb".
- Third one should be simplest like "light".
**Go through notes and steps and make sure they are all followed. Rule breaks are strictly not allowed.**
"""
def generate_presentation_stream(
titles: List[str],
prompt: str,
n_slides: int,
language: str,
summary: str,
) -> AsyncIterator[AIMessageChunk]:
schema = LLMPresentationModel.model_json_schema()
system_prompt = f"{CREATE_PRESENTATION_PROMPT} -|0|--|0|- Follow this schema while giving out response: {schema}. Make description short and obey the character limits. Output should be in JSON format. Give out only JSON, nothing else."
system_prompt = SystemMessage(system_prompt.replace("-|0|-", "\n"))
user_message = f"Prompt: {prompt}-|0|--|0|- Number of Slides: {n_slides}-|0|--|0|- Presentation Language: {language} -|0|--|0|- Slide Titles: {titles} -|0|--|0|- Reference Document: {summary}"
user_message = HumanMessage(user_message.replace("-|0|-", "\n"))
model = (
ChatOpenAI(model="gpt-4.1")
if os.getenv("LLM") == "openai"
else ChatGoogleGenerativeAI(model="gemini-2.0-flash")
)
return model.astream([system_prompt, user_message])