Database Service¶
PostgreSQL connection management, schema introspection, and data import.
Classes¶
DatabaseServiceError
¶
Bases: Exception
Base exception for database service errors.
ConnectionExpiredError
¶
Bases: DatabaseServiceError
Raised when a connection handle has expired.
TableNotFoundError
¶
Bases: DatabaseServiceError
Raised when a table is not found.
InvalidColumnError
¶
Bases: DatabaseServiceError
Raised when an invalid column is referenced.
QuerySafetyError
¶
Bases: DatabaseServiceError
Raised when a query violates safety constraints.
Functions¶
connect(conn)
async
¶
Test connection and create a handle if successful.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
conn
|
DatabaseConnectionRequest
|
Database connection request |
required |
Returns:
| Type | Description |
|---|---|
tuple[str, str | None]
|
Tuple of (handle, version) where version is the database version string |
Raises:
| Type | Description |
|---|---|
DatabaseServiceError
|
If connection fails |
Source code in backend/app/services/database_service.py
list_tables(handle)
async
¶
List tables with estimated row counts.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
handle
|
str
|
Connection handle |
required |
Returns:
| Type | Description |
|---|---|
list[TableInfo]
|
List of table information |
Raises:
| Type | Description |
|---|---|
ConnectionExpiredError
|
If handle is invalid or expired |
Source code in backend/app/services/database_service.py
get_schema(handle, table)
async
¶
Get column schema and sample values for a table.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
handle
|
str
|
Connection handle |
required |
table
|
TableIdentifier
|
Table identifier |
required |
Returns:
| Type | Description |
|---|---|
TableSchemaResponse
|
Table schema response with columns and sample values |
Raises:
| Type | Description |
|---|---|
ConnectionExpiredError
|
If handle is invalid or expired |
TableNotFoundError
|
If table doesn't exist |
Source code in backend/app/services/database_service.py
get_distinct_values(handle, table, column, limit=100)
async
¶
Get distinct values for a column (for filter dropdowns).
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
handle
|
str
|
Connection handle |
required |
table
|
TableIdentifier
|
Table identifier |
required |
column
|
str
|
Column name |
required |
limit
|
int
|
Maximum values to return |
100
|
Returns:
| Type | Description |
|---|---|
list[str]
|
List of distinct values as strings |
Raises:
| Type | Description |
|---|---|
ConnectionExpiredError
|
If handle is invalid or expired |
InvalidColumnError
|
If column doesn't exist |
Source code in backend/app/services/database_service.py
preview_data(handle, table, mappings, filters=None, limit=10)
async
¶
Preview data with column mappings applied.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
handle
|
str
|
Connection handle |
required |
table
|
TableIdentifier
|
Table identifier |
required |
mappings
|
list[ColumnMapping]
|
Column mappings to apply |
required |
filters
|
list[FilterCondition] | None
|
Optional filter conditions |
None
|
limit
|
int
|
Number of rows to preview |
10
|
Returns:
| Type | Description |
|---|---|
list[dict[str, Any]]
|
List of dictionaries with mapped column names |
Raises:
| Type | Description |
|---|---|
ConnectionExpiredError
|
If handle is invalid or expired |
InvalidColumnError
|
If a mapped column doesn't exist |
Source code in backend/app/services/database_service.py
import_data(handle, table, mappings, filters=None, limit=10000, dedupe_on_id=True)
async
¶
Import data from database with column mappings.
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
handle
|
str
|
Connection handle |
required |
table
|
TableIdentifier
|
Table identifier |
required |
mappings
|
list[ColumnMapping]
|
Column mappings to apply |
required |
filters
|
list[FilterCondition] | None
|
Optional filter conditions |
None
|
limit
|
int
|
Maximum rows to import |
10000
|
dedupe_on_id
|
bool
|
Whether to deduplicate by id column |
True
|
Returns:
| Type | Description |
|---|---|
list[dict[str, Any]]
|
List of dictionaries with mapped column names |
Raises:
| Type | Description |
|---|---|
ConnectionExpiredError
|
If handle is invalid or expired |
InvalidColumnError
|
If a mapped column doesn't exist |
Source code in backend/app/services/database_service.py
356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 | |
execute_query(handle, query, limit=10, timeout_ms=60000)
async
¶
Execute an arbitrary SELECT query with safety guards.
Safety layers: 1. Session-level read-only mode 2. Session-level statement timeout 3. Single-statement enforcement (done at schema validation) 4. LIMIT appended to query
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
handle
|
str
|
Connection handle |
required |
query
|
str
|
SQL query (already validated by schema) |
required |
limit
|
int
|
Maximum rows to return |
10
|
timeout_ms
|
int
|
Statement timeout in milliseconds |
60000
|
Returns:
| Type | Description |
|---|---|
list[dict[str, Any]]
|
List of result dictionaries |
Source code in backend/app/services/database_service.py
preview_data_all_columns(handle, table, filters=None, limit=10)
async
¶
Preview all columns from a table (no mapping step).
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
handle
|
str
|
Connection handle |
required |
table
|
TableIdentifier
|
Table identifier |
required |
filters
|
list[FilterCondition] | None
|
Optional filter conditions |
None
|
limit
|
int
|
Number of rows to preview |
10
|
Returns:
| Type | Description |
|---|---|
list[dict[str, Any]]
|
List of raw dictionaries with all columns |
Source code in backend/app/services/database_service.py
import_data_all_columns(handle, table, filters=None, limit=10000, dedupe_on_id=True)
async
¶
Import all columns from a table (no mapping step).
Parameters:
| Name | Type | Description | Default |
|---|---|---|---|
handle
|
str
|
Connection handle |
required |
table
|
TableIdentifier
|
Table identifier |
required |
filters
|
list[FilterCondition] | None
|
Optional filter conditions |
None
|
limit
|
int
|
Maximum rows to import |
10000
|
dedupe_on_id
|
bool
|
Whether to deduplicate by dataset_id or id column |
True
|
Returns:
| Type | Description |
|---|---|
list[dict[str, Any]]
|
List of raw dictionaries |
Source code in backend/app/services/database_service.py
571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 | |