Implement the server by levilevente · Pull Request #13 · levilevente/GalacticView-agent

levilevente · 2025-12-01T19:42:10Z

This pull request introduces a new FastAPI-based web server for the project, enabling an HTTP endpoint for chat interactions with the agent. It also adds data transfer objects (DTOs) for request and response validation, updates dependencies, and refactors the CLI entry point. The most important changes are grouped below:

API Server Implementation:

Added a FastAPI server in server/serve.py, exposing a /chat endpoint that accepts chat questions and returns structured responses. The server is runnable via the new galacticview_app entry point.
Implemented the chat_ask_question service function in server/service.py, which interacts with the agent and formats the output using the new DTOs.

Data Transfer Objects (DTOs):

Introduced ChatTypeIn and ChatTypeOut models in server/dto/chat_type_in.py and server/dto/chat_type_out.py, respectively, and updated server/dto/__init__.py for easier imports. These models validate and structure the chat request and response data. [1] [2] [3]

Dependency and Entry Point Updates:

Added fastapi[standard] to the dependencies in pyproject.toml and defined new CLI and app entry points (galacticview_cli and galacticview_app). The CLI entry point was refactored to point to galacticview_bot.cli:main.

Project Initialization:

Updated galacticview_bot/__init__.py to expose the app agent for use in the server and service layers.

RELATED ISSUE #6

Copilot

Pull request overview

This pull request implements a FastAPI-based web server for the GalacticView agent, enabling HTTP API access to the existing agent functionality. The implementation adds new server components, data transfer objects for request/response handling, and refactors the CLI to work alongside the new API.

Key Changes

FastAPI Server: Introduces a production-ready HTTP endpoint at /chat for interacting with the agent via REST API
Data Transfer Objects: Adds Pydantic models (ChatTypeIn, ChatTypeOut) for structured request validation and response formatting
Dual Entry Points: Refactors the project to support both CLI (galacticview_cli) and API server (galacticview_app) entry points

Reviewed changes

Copilot reviewed 7 out of 10 changed files in this pull request and generated 11 comments.

Show a summary per file

File	Description
`server/serve.py`	FastAPI application setup with `/chat` endpoint and uvicorn server configuration
`server/service.py`	Business logic layer that processes chat requests by interacting with the agent
`server/dto/chat_type_in.py`	Input DTO defining the structure for chat requests (question and datetime)
`server/dto/chat_type_out.py`	Output DTO defining the structure for chat responses (title, content, key_metrics)
`server/dto/__init__.py`	Package initialization exposing DTOs for easier imports
`pyproject.toml`	Adds fastapi[standard] dependency and defines new CLI/app entry points
`galacticview_bot/cli.py`	New CLI implementation moved from root, maintaining existing agent interaction logic
`galacticview_bot/__init__.py`	Exports the agent app for use in server layer and adjusts logging setup formatting

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

server/dto/chat_type_in.py

server/service.py

server/dto/chat_type_in.py

server/serve.py

server/dto/chat_type_in.py

server/serve.py

pyproject.toml

Copilot

Pull request overview

Copilot reviewed 10 out of 12 changed files in this pull request and generated 12 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

server/serve.py

server/service.py

galacticview_bot/__init__.py

README.md

server/serve.py

server/service.py

server/serve.py

server/service.py

Copilot

Pull request overview

Copilot reviewed 10 out of 12 changed files in this pull request and generated 16 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

server/dto/chat_type_in.py

Copilot · 2025-12-01T20:41:00Z

server/dto/chat_type_in.py

+    date: datetime = Field(default_factory=datetime.now)
+


The date field in ChatTypeIn is defined with Field(default_factory=datetime.now), but it's not being validated or used anywhere in the service layer. If the intention is to track when questions are asked, this should be renamed to clarify its purpose (e.g., timestamp or created_at), and the service should potentially log or store this information.

If this field is not needed, it should be removed to keep the API contract clean.

Suggested change

date: datetime = Field(default_factory=datetime.now)

server/service.py

server/serve.py

README.md

Copilot · 2025-12-01T20:41:02Z

server/service.py

+    )
+
+    try:
+        thread_id = f"aerospace-agent-thread-{uuid.uuid4()}"


The thread ID generation uses a UUID which creates a new thread for every request. This means the agent has no conversation memory between requests - each question is treated as a completely new conversation.

If this is intentional (stateless API), consider documenting this behavior clearly. If conversation history is desired, you'll need to:

Accept an optional thread_id in the request

Store conversation state in a persistent backend (Redis, database, etc.)

Return the thread_id in the response so clients can maintain context

The current implementation contradicts typical chat API patterns where context is maintained.

Copilot · 2025-12-01T20:41:03Z

server/serve.py

+def chat_endpoint(request: Request, body: ChatTypeIn) -> ChatTypeOut:
+    """
+    Process chat questions using the agent and return structured responses.
+    Rate limited to 7 requests per minute per IP.


[nitpick] The /chat endpoint lacks API documentation that would appear in FastAPI's automatic OpenAPI/Swagger docs. Consider adding a more detailed docstring with parameter descriptions and response examples:

@app.post("/chat") @limiter.limit("7/minute") def chat_endpoint(request: Request, body: ChatTypeIn) -> ChatTypeOut: """ Process chat questions using the agent and return structured responses. Args: request: FastAPI request object (used for rate limiting) body: Chat input containing the user's question Returns: ChatTypeOut: Structured response with title, content, and key metrics Rate Limits: 7 requests per minute per IP address Example: POST /chat { "question": "What is the latest news about Mars rovers?" } """

This improves API discoverability and developer experience.

Suggested change

Rate limited to 7 requests per minute per IP.

Parameters:

request (Request): FastAPI request object (used for rate limiting).

body (ChatTypeIn): Chat input containing the user's question.

Returns:

ChatTypeOut: Structured response containing the answer, title, and key metrics.

Rate Limits:

7 requests per minute per IP address.

Example request:

POST /chat

{

"question": "What is the latest news about Mars rovers?"

}

Example response:

{

"title": "Latest News About Mars Rovers",

"content": "NASA's Perseverance rover recently discovered...",

"metrics": {

"relevance": 0.98,

"sources": ["nasa.gov", "space.com"]

}

}

server/dto/chat_type_in.py

Copilot · 2025-12-01T20:41:03Z

server/serve.py

+
+app.add_middleware(
+    CORSMiddleware,
+    allow_origins=["*"],


The CORS middleware is configured with allow_origins=["*"], which allows requests from any origin. While this may be acceptable for development, it poses a security risk in production environments. Consider restricting this to specific allowed origins, especially since the README mentions this is intended to integrate with a specific frontend application.

Recommendation:

allow_origins=os.getenv("ALLOWED_ORIGINS", "*").split(",")

This allows configuration via environment variable while defaulting to "*" for development.

Suggested change

allow_origins=["*"],

allow_origins=os.getenv("ALLOWED_ORIGINS", "*").split(","),

Copilot · 2025-12-01T20:41:04Z

server/serve.py

+    uvicorn.run("server.serve:app", host=host, port=8000, reload=reload)
+


The server configuration binds to 0.0.0.0 in production mode, which is correct for containerized deployments. However, there's no configuration for:

Port configuration: The port is hard-coded to 8000. This should be configurable via environment variable:

port = int(os.getenv("PORT", "8000"))

Worker processes: For production deployments, you typically want multiple worker processes to handle concurrent requests. Consider documenting the need to use a process manager like gunicorn or adding workers configuration:

workers = int(os.getenv("WORKERS", "1"))

SSL/TLS: No mention of HTTPS configuration. While this might be handled by a reverse proxy, it should be documented.

Suggested change

uvicorn.run("server.serve:app", host=host, port=8000, reload=reload)

# Port and worker configuration via environment variables

port = int(os.getenv("PORT", "8000"))

workers = int(os.getenv("WORKERS", "1"))

# Note: For production deployments, SSL/TLS should be handled by a reverse proxy (e.g., nginx, traefik).

if env == "prod":

uvicorn.run("server.serve:app", host=host, port=port, workers=workers)

else:

uvicorn.run("server.serve:app", host=host, port=port, reload=reload)

levilevente added 4 commits December 1, 2025 12:53

dependency installed

92e0b60

service + app + serve done

a01e5da

fix lint errors

65f5d0c

remove /

9dca0dd

levilevente self-assigned this Dec 1, 2025

levilevente requested a review from Copilot December 1, 2025 19:42

Copilot started reviewing on behalf of levilevente December 1, 2025 19:42 View session

Copilot finished reviewing on behalf of levilevente December 1, 2025 19:45

Copilot AI reviewed Dec 1, 2025

View reviewed changes

levilevente added 5 commits December 1, 2025 21:52

readme update

ef81b0b

update 2

15e21b2

resolved comments on PR

bdaa582

fix lint

95da650

fix thread number generating

f13a90f

levilevente requested a review from Copilot December 1, 2025 20:06

Copilot started reviewing on behalf of levilevente December 1, 2025 20:06 View session

Copilot finished reviewing on behalf of levilevente December 1, 2025 20:09

Copilot AI reviewed Dec 1, 2025

View reviewed changes

levilevente added 2 commits December 1, 2025 22:31

added rate limiter, field validation

bd05297

lint ignore

9c1e00c

levilevente requested a review from Copilot December 1, 2025 20:32

Copilot started reviewing on behalf of levilevente December 1, 2025 20:33 View session

Copilot finished reviewing on behalf of levilevente December 1, 2025 20:36

Copilot AI reviewed Dec 1, 2025

View reviewed changes

solve comments

d423004

levilevente mentioned this pull request Dec 1, 2025

Final finish #14

Open

levilevente merged commit e21dd3a into main Dec 1, 2025
4 checks passed

levilevente linked an issue Dec 14, 2025 that may be closed by this pull request

Implement the server #6

Closed

-    Rate limited to 7 requests per minute per IP.
+    Parameters:
+        request (Request): FastAPI request object (used for rate limiting).
+        body (ChatTypeIn): Chat input containing the user's question.
+    Returns:
+        ChatTypeOut: Structured response containing the answer, title, and key metrics.
+    Rate Limits:
+requests per minute per IP address.
+    Example request:
+        POST /chat
+        {
+            "question": "What is the latest news about Mars rovers?"
+        }
+    Example response:
+        {
+            "title": "Latest News About Mars Rovers",
+            "content": "NASA's Perseverance rover recently discovered...",
+            "metrics": {
+                "relevance": 0.98,
+                "sources": ["nasa.gov", "space.com"]
+            }
+        }

	allow_origins=["*"],
	allow_origins=os.getenv("ALLOWED_ORIGINS", "*").split(","),

		uvicorn.run("server.serve:app", host=host, port=8000, reload=reload)

-    uvicorn.run("server.serve:app", host=host, port=8000, reload=reload)
+    # Port and worker configuration via environment variables
+    port = int(os.getenv("PORT", "8000"))
+    workers = int(os.getenv("WORKERS", "1"))
+    # Note: For production deployments, SSL/TLS should be handled by a reverse proxy (e.g., nginx, traefik).
+    if env == "prod":
+        uvicorn.run("server.serve:app", host=host, port=port, workers=workers)
+    else:
+        uvicorn.run("server.serve:app", host=host, port=port, reload=reload)

Conversation

levilevente commented Dec 1, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants