Mar 7, 2025
Smarter Deployments with Completions UI

We've enhanced the Deploy Page with a completions UI, making it easier and faster to interact with deployment configurations. This update improves user experience by offering better guidance and reducing friction when setting up deployments.
Better API Recommendations
Our API Recommendation System is now smarter! We added in-memory vector similarity to function_call.py
, improving the way we suggest APIs based on past interactions. This makes our system more efficient at delivering relevant recommendations.
Cosmos DB Integration
We've integrated Cosmos DB queries, making it easier to work with large-scale, distributed databases. This enhancement allows for seamless querying and better data management, improving overall system performance.

Chat Completions API Compatibility
We introduced a chat completions-compatible endpoint, aligning with OpenAI’s API format. This makes it easier for developers to integrate our chat-based functionalities with existing AI-driven workflows.
Fixes & Improvements
API & Infrastructure
Added Milvus as a vector store backend.
Exposed the Live API directly in the Deploy Page.
Added
webui-tools.json
andtools.py
as new API endpoints.
User Experience & Bug Fixes
Fixed SQL data object creation when selecting a table instead of writing raw SQL.
Ensured the Validate button matches the Connect button color for visual consistency.
Addressed a "No datasources connected" error in the Deploy Page.
Improved schema slot handling for query programs, simplifying developer workflows.
Security & Deployment Enhancements
Reconfigured Cloudflare SSL, ensuring secure deployments.
Created stronger config maps for each customer deployment.
Automated deployment of frontend, backend, and database backups for each customer.
Feb 21, 2025
PostgreSQL Connectivity & Performance Upgrades
We’re excited to announce that Infactory now supports direct connections to PostgreSQL databases! This unlocks a world of possibilities, making it easier to query, analyze, and integrate your data seamlessly. Along with this, we’ve introduced a Custom SQL tab on the PostgreSQL connection page, giving you more flexibility in writing and running SQL queries directly.

Additionally, we've made improvements to query autogeneration when connecting to PostgreSQL, streamlining the experience for users who want quick insights without manually crafting queries.
Fixes & Improvements
Performance Optimization
Updated CloudWatch dashboards & alarms for better monitoring of our infrastructure.
Switched runner hosts & backend API instances to more cost-effective AWS instance types.
Enabled dynamic instance selection in Terraform to optimize costs (e.g., Spot instances for development).
Implemented automatic Docker image pruning for self-hosted runners to keep environments clean.
UX/UI Enhancements
Scrollable query lists for a smoother experience.
Aligned query editor and chat window styles for consistency.
Refined color scheme in the query editor to match Infactory’s design.
Deploy button fixes – resets state after clicking and is now positioned more intuitively.
Bug Fixes
Fixed S3 chunking issue to optimize large file uploads.
Resolved CSV upload flow issues – improving performance and security.
Fixed query execution errors that caused subsequent runs to fail.
Addressed progressive chart breakage and improved auto-generated query quality.
With PostgreSQL integration and a host of performance upgrades, Infactory is now more powerful and efficient than ever. Let us know what you think! 🚀
Feb 16, 2025
Improved User Experience and Robust Security Enhancements
Feature Updates
Project Management: Enhanced Editing and Syncing
Streamline your project management with new editing capabilities. Users can now effortlessly update project details, with the added benefit of conversation syncing for seamless transitions between projects. The new ProjectForm
component simplifies the creation and editing processes, boosting productivity and user satisfaction.
Advanced Authentication with RBAC and Casbin
Upgrade your security protocols with role-based access control (RBAC) and Casbin integration. This new feature enhances security by managing user permissions effectively, ensuring that only authorized users can access specific API endpoints.
Enhancements & UX Improvements
Data Visualization: Customizable Charting
Experience improved data visualization with live chart customization. Users can now toggle series data on and off, allowing for more personalized and meaningful data exploration.
Enhanced Code Editor Usability
Enjoy a more intuitive coding experience with enhancements to the Code Editor. Unwanted text patterns are now automatically cleaned, and horizontal scrolling is introduced for better readability and usability.
File Upload Improvements
The FileUpload component now restricts uploads to CSV files only, ensuring data consistency. Users will receive toast notifications for any errors, enhancing the overall user experience.
Performance & Reliability
Panel Size Management for Consistent Layouts
Manage your workspace efficiently with the new panel size management feature. This update ensures consistent panel resizing across various layouts, providing a more organized and user-friendly interface.
Improved DataFrame Caching
Boost performance with local DataFrame caching. This enhancement reduces loading times, ensuring that data is readily available and improving overall application responsiveness.
Bug Fixes
Resolved an issue where downloading charts was not functioning correctly, ensuring reliable data export capabilities.
Fixed a problem with authentication in development environments, re-enabling user authentication requirements for improved security.
Corrected a bug that prevented proper chart rendering in the DataFrame Renderer, ensuring accurate and reliable data visualization.
Feb 14, 2025
A More Reliable and Intuitive Infactory

This cycle, we focused on making Infactory more stable, secure, and user-friendly. We improved authentication, enhanced error handling, and fixed key UI/UX issues. Additionally, we made major improvements to our API and deployment processes to ensure smoother workflows.
Highlights
Role-Based Authentication (RBAC)
We introduced role-based authentication to enforce access control. Each user now has anorg_admin
role by default, limiting CRUD operations to their organization and preventing cross-organization data access.Smarter Page Titles
Instead of falling back to a generic phrase like "Find knowledge in your data", pages now use more descriptive and relevant titles.Improved Cloud Deployment
Provisioned S3 buckets for deployment logs, improving logging and debugging for on-premise installations.
Migrated Cloudflare DNS management to Terraform, allowing for more structured and reproducible infrastructure updates.
Landing Page Revamp
We launched a new Landing Page, refining the first touchpoint with users and ensuring a smoother onboarding experience.
Fixes & Improvements
User Experience & Interface
Fixed Infactory Assistant overflowing the screen in the Build page.
Removed unnecessary console logs in chat, significantly improving performance.
Syntax highlighter no longer re-renders on every keystroke, reducing lag.
The query program editor now removes redundant code markers on first insertion.
Deployment & Infrastructure
Updated CloudWatch alarms with longer-duration triggers for more reliable alerting.
Fixed an intermittent network error affecting Docker builds.
Resolved authentication issues in the dev environment to restore access control.
Prevented running/deploying an empty query program, ensuring valid input.
Improved API routing and request handling for better performance.
Data & Query Handling
Fixed an issue where deployed query program buttons disappeared after deployment.
Improved metadata extraction from query programs for better query generation.
Fixed chart downloads, ensuring users can properly export visualized data.
Added live chart customization options, giving users more control over how data is displayed.
Security & Access Control
Fixed Invalid AZP Claim errors on authentication, ensuring smoother logins.
Removed all outdated tests from GitHub since they were no longer functional.
Other Improvements
Updated README with a Quick Start Guide for easier setup.
Renamed CRUD directories and components to better reflect their purpose.
Integrated APIGen Dataset for enhanced API recommendation functionality.
Thanks for your continued feedback! More exciting updates are on the way 🚀.
Feb 9, 2025
Enhanced API Flexibility and Frontend Refinements
Feature Updates
Fivetran Integration: Expanding Data Connectivity
We've introduced Fivetran integration into the frontend, significantly enhancing your data integration capabilities. This allows for seamless data synchronization and management across your projects. Now, you can set default sizes for ResizablePanel components, providing a more intuitive and flexible user experience.
Improved Explore Experience: More Intuitive Data Interactions
Enhancements have been made to the explore page, including the addition of a fallback function to improve UI interactions and rendering of tables and charts. These changes ensure a smoother, more reliable exploration experience.
API Enhancements: Better Routing and Flexibility
We've updated our API to support both slashed and unslashed URL paths, enhancing flexibility and consistency in API usage. This change simplifies routing and improves user experience by allowing more intuitive API interactions.
Enhancements & UX Improvements
Streamlined Message Handling
We have improved message handling by introducing error checks and removing redundant message entries. This ensures a cleaner and more efficient chat history, allowing you to focus on meaningful interactions.
Refined Example Programs
Our example programs have been updated with new fixes and enhancements, ensuring they are more comprehensive and easier to use. This update includes re-generated examples for baseball and Posthog integrations, providing better insights and learning opportunities.
Performance Optimization: Cleaner Console Logs
Unnecessary console log statements have been removed, reducing log clutter and improving performance. This results in a faster and more responsive streaming text experience.
Bug Fixes
Resolved an issue where CSV uploads were not sending data to S3, improving data upload reliability.
Fixed JSON formatting bugs to ensure data integrity and prevent processing errors.
Addressed issues with conversation history not displaying completely, ensuring all relevant data is visible.
Developer Experience
Enhanced SDK Error Handling
Improved error handling in our SDK now passes specific 4xx errors back to the frontend, allowing for more precise debugging and user feedback. This ensures that issues are caught and addressed more efficiently, enhancing overall reliability.
Codebase Optimization
Various refactoring efforts have been undertaken to streamline our codebase, including the removal of redundant routes and unused code. These improvements aid in maintainability and readability, making it easier for developers to work with the code.
These changes aim to enhance your experience by providing more robust features, improving interaction quality, and ensuring smoother operation across our platform.
Feb 2, 2025
Enhanced Fivetran Integration and Improved Query Interface
Feature Updates
Fivetran Integration: Seamless Data Connectivity
The integration with Fivetran has been significantly enhanced, enabling seamless data connectivity directly within your dashboard. This update includes the addition of connector detail pages and improved API routes, providing users with a more intuitive and efficient data management experience. The integration also supports real-time status updates and enhanced error handling, ensuring smoother operations and better data visibility.
Query Program Enhancements: Streamlined Execution and Management
The query program interface has been upgraded to support execution time tracking and refined result handling. These improvements simplify query execution and enhance user experience by providing more detailed feedback and better management of query states. Additionally, a new context menu for query tabs has been introduced, offering flexible tab management options for a clutter-free workspace.
Enhancements & UX Improvements
Custom Formatting and Data Model Updates
The editor now supports custom formatting for new data types, bringing back the accept/reject button experience for a more interactive and user-friendly interface. Data models have been updated to include new fields, improving data handling and integration consistency across the platform.
Performance & Reliability
Cleaner Shutdowns and Optimized Data Retrieval
A new trap has been added for cleaner shutdowns, ensuring system stability and integrity during operations. SQL queries have been optimized for improved readability and performance, enhancing data retrieval processes and overall application reliability.
Bug Fixes
Fixed data source upload issues by deprecating outdated column specifications, ensuring smoother data handling.
Resolved a bug in the chat graph that improved explanation generation, enhancing functionality on the explore page.
Developer Experience
Improved API and Code Management
The API management system has been upgraded to include optimistic updates and better state integration, streamlining developer workflows. Code refactoring across various components has improved readability and maintainability, creating a more robust development environment.
Jan 26, 2025
Enhanced Chat Functionality and Query Improvements
Feature Updates
Enhanced Chat Functionality with Explanations
We've added a new component to the chat system that provides detailed explanations for function responses. This enhancement ensures that users have clear context and understanding of the chat interactions, significantly improving the user experience when engaging with automated tools.
PostHog Query Deployment and Enhancements
A comprehensive set of queries has been deployed for PostHog, enhancing data analysis capabilities. These changes include updates to funnel queries and improvements in test accuracy. Note that the new queries require regeneration due to non-backwards compatible changes, ensuring more robust data insights.
Enhancements & UX Improvements
Improved Vertical Scroll in Build Pane
We've implemented vertical scrolling within the build pane chat answer section, enhancing navigability and user interaction with chat responses, particularly useful during in-depth discussions and long responses.
Dashboard Integration in App Sidebar
The in-app sidebar now features a dedicated dashboard, providing quick access to key metrics and navigation options. This update streamlines user interaction with core app functionalities, boosting productivity and ease of use.
Bug Fixes
Corrected a spelling inconsistency from "explaination" to "explanation" across multiple files, enhancing clarity and consistency in the chat module.
Addressed a glitch in the
test_stocks_general_005.py
where tests were erroneously stuck in a "smoke test" mode, ensuring smoother test operations.Fixed issues with PostHog queries involving sliding time windows, resolving errors and improving query reliability.
Developer Experience
Code Cleanup and Refactoring
Significant code cleanups and refactoring were undertaken to improve maintainability and readability of the chat components and related API routes. This includes streamlining message rendering logic and enhancing the format_message_endpoint function, making it easier for developers to navigate and extend the codebase.
Jan 19, 2025
Enhanced Explore Page and Improved Query Handling
Feature Updates
Explore Page Revamp for Beta Launch
The Explore page has undergone significant updates in preparation for its beta launch. This includes improved chat functionality with enhanced conversation context, real-time status updates, and a more intuitive layout. Users will experience smoother interactions and better data representation, thanks to the integration of a conversation graph model.
Multiple Dataframe Query Support
Queries now support multiple dataframes, enhancing the flexibility and power of data analysis. This new feature allows users to execute complex queries across various data sources, making the platform more robust and versatile.
Improved Slot Filling with Generous Acceptance
The system now generously accepts array-like strings and taxon leaf values for slot filling, simplifying data input processes. This change ensures a more seamless integration of data, reducing errors and improving overall efficiency.
Enhancements & UX Improvements
Enhanced Loading Animations
Loading animations have been refined for a more visually appealing user experience. This improvement helps users better understand the system's processing status, reducing uncertainty during wait times.
Frontend Interface Tweaks
Several small frontend fixes have been implemented, including inline "Thinking..." badges and improved button aesthetics in chat components. These updates ensure a cleaner, more intuitive user interaction with the platform.
Console Logging Improvements
Console outputs have been streamlined by replacing unnecessary log statements with appropriate log levels. This cleanup enhances code readability and helps developers quickly identify potential issues during local deployments.
Bug Fixes
Resolved issues with stray text in posthog query examples, ensuring cleaner and more accurate query results.
Fixed glitches in the Explore page that previously affected data scrolling and rendering.
Addressed errors in the posthog query integration, ensuring smoother data visualization in AG Grid and AG Chart components.
These updates collectively enhance both the user and developer experience, making the platform more efficient, responsive, and user-friendly.
Jan 12, 2025
Robust Data Handling and Enhanced API Flexibility
Feature Updates
Improved Datetime Handling for Consistent Data Processing
We have standardized our approach to datetime handling across the system, ensuring uniformity and reducing errors in query operations. This change enhances data integrity and reliability, especially in complex data integrations.
Simplified Schema Management with Direct Dataset Naming
ColumnT and SchemaT names now leverage CSV filenames directly, providing clearer and more intuitive schema management. This change streamlines processes and reduces reliance on AI-generated names for improved clarity and usability.
Enhanced API Documentation Flexibility
API documentation can now dynamically determine host and scheme from request headers, making the documentation more adaptable and accurate across various deployment environments.
Enhancements & UX Improvements
Robust CSV Upload and Data Inference Capabilities
Our CSV upload process has been fortified with improved data inference, particularly for handling complex datasets like posthog data. This ensures more accurate data sampling and better performance in data handling tasks.
Improved Query Handling in Workshop Components
We've refactored various components in our workshop interface, enhancing message handling, formatting, and state management for a smoother user experience when working with complex queries.
Performance & Reliability
ConnectorX Dependency Integration
The addition of ConnectorX as a dependency improves data connectivity and transfer speeds, enhancing overall system performance for data-intensive tasks.
Dynamic Runnable Checks in Query Programs
Query programs now skip steps with unfilled slots, optimizing performance by avoiding unnecessary computations. This ensures that only fully prepared operations are executed, improving efficiency.
Bug Fixes
Resolved issues with OpenAPI documentation visibility in production by updating environment configurations and host parameter handling.
Fixed an error in ask_dataline.py to simplify error handling and improve system stability.
Jan 5, 2025
Enhanced Query Handling and Improved Job Management
Feature Updates
Robust Job Management System
A comprehensive job management system has been implemented, allowing better submission, status updates, and metrics tracking. This new feature streamlines job-related API requests and enhances the overall workflow for managing job processes.
Expanded Query Example Library
We've added a multitude of new query examples across datasets such as ads, coffee, stocks, and baseball. These examples are designed to improve the machine learning model's ability to generate accurate and relevant queries.
Enhancements & UX Improvements
Query Program Error Reporting
Basic error reporting has been introduced for failed query program generations. This enhancement helps users identify and troubleshoot issues more effectively, paving the way for smoother query development.
Improved Query Program Prompts
The query program generation process has been refined by fixing misleading prompts, reducing errors in generated queries. This change ensures more accurate query development for users.
Enhanced Table Formatting
We've made visual improvements to result tables, including width adjustments and better rounding of floats. These enhancements provide a cleaner and more readable presentation of data outputs.
Streamlined URL Handling in FastAPI
FastAPI apps now feature an updated configuration that prevents automatic redirection of URLs with trailing slashes. This change enhances URL management and improves user navigation experiences.
Bug Fixes
Resolved an issue where non-existent columns were being included in query programs due to confusing prompts, improving the accuracy of query generation.
Dec 29, 2024
Comprehensive Flow Cleanup and Enhanced Query Management
Feature Updates
Improved Query Deletion and Management
The soft delete capabilities for query programs have been enhanced with error handling and user feedback. Users can now unpublish query programs before deletion, ensuring smoother management and better control over their query lifecycle.
Enhanced Conversation Creation and Management
Introducing new functionality to create conversations directly from the Explore Layout. This update includes a streamlined interface for starting conversations, enhanced layout presentation, and a new icon to indicate a "no conversation" state, making user interactions more intuitive and engaging.
Simplified Structured Data Handling
The handling of structured data has been simplified by removing the "df_" prefix from query program methods and eliminating the use of confusing GeneralColumnT types. These changes streamline the query writing process, making it more intuitive and consistent.
Enhancements & UX Improvements
UI Consistency and Readability
Numerous improvements have been made to ensure a consistent and readable UI. This includes better handling of text areas post-navigation, improved spacing in components, and enhanced feedback messages during query actions. These changes contribute to a smoother and more cohesive user experience across the platform.
Layout and Usability Enhancements
The Explore Layout now features conversation filtering and selection functionalities, along with search inputs and tooltips for creating new conversations. These enhancements aim to improve accessibility and usability, allowing users to navigate and manage their conversations more effectively.
Bug Fixes
Addressed timing issues with textarea focus after navigation, improving user experience during data entry processes.
Resolved issues with incorrect item rendering in the renderer component, enhancing reliability and user feedback.
Fixed formatting issues and ensured consistent order in sidebar navigation, improving overall interface consistency.
Developer Experience
Type Safety and Compatibility Improvements
Updated TypeScript and related type definitions to the latest versions, improving type safety and compatibility across the project. This ensures better maintainability and reduced potential for runtime errors.
Codebase Cleanup and Optimization
Removed obsolete code and unused version history data, reducing clutter and improving repository maintainability. These optimizations contribute to a cleaner and more efficient development environment.
Dec 22, 2024
Enhanced Chat Features and Query Program Improvements
Feature Updates
Chat with Your Data: Screen and Flow Improvements
Interact with your data more smoothly with enhanced chat functionalities. The chat screen now better manages conversations, implements streaming responses for real-time communication, and supports tool calls, providing a more robust interactive experience.
Query Program Generation: Consistency and Optimization
We've reworked our query program generation for enhanced clarity and reduced redundancy. This update ensures that slot generation is more consistent, improving the overall performance of query handling.
Enhancements & UX Improvements
Slot Management: Enhanced Flexibility
Slot management received a significant upgrade, allowing for more consistent type validation and flexibility in slot creation. This change ensures that slots are generated more accurately, adhering to schema constraints.
Improved Program Flow and Naming Consistency
The program generation flow has been refined to use consistent patterns across functions, reducing confusion and allowing for smoother integration and future updates.
Performance & Reliability
Aggregation Methods: Simplified and Streamlined
The aggregation process has been consolidated into a single, more intuitive method. This simplification reduces errors and makes it easier for our logic model to handle complex grouping and aggregation tasks.
Chat Flow Parsing: Enhanced Accuracy
Parsing improvements in the chat flow ensure that only the most relevant tool call information is processed, reducing confusion and improving the reliability of ongoing conversations.
Bug Fixes
Resolved deprecation warnings from Pydantic, enhancing compatibility and future-proofing our codebase.
Addressed issues with slot filling, ensuring that slots are correctly managed as lists rather than dictionaries.
Fixed a bug that prevented some column functions from executing correctly, improving query reliability.
Developer Experience
Codebase Cleanup and Consistency
Our codebase has undergone significant cleanup, improving readability and maintainability. This includes standardizing naming conventions and refactoring redundant code segments for a cleaner development environment.
Dec 15, 2024
Comprehensive API Enhancements and Query System Optimizations
Feature Updates
End-to-End API Publishing and Customization
Streamline your API management with robust new features for creating, editing, and deleting APIs. Enhanced validation ensures reliable endpoint configurations, while optional hard delete functionality allows for permanent API removal when necessary. These updates simplify API lifecycle management and improve consistency across your projects.
Custom API Editor in Frontend
Introducing a customizable API editor interface for frontend users. This new tool enhances how you interact with API definitions directly from the UI, allowing for more intuitive API management and quicker updates without diving into the backend.
Generalize and Create Query Program Endpoints
New endpoints support the generalization and creation of query programs, enabling more flexible query definitions and execution. This enhancement facilitates dynamic query management, allowing users to adapt and extend their querying capabilities quickly.
Enhancements & UX Improvements
Improved Expression Slots in Query System
Experience a smoother querying process with updates to expression slots. This comprehensive overhaul enhances syntax handling and allows for more complex query constructions, improving the overall efficiency and flexibility of data queries.
API Endpoints Enhancement
Newly added API endpoints extend the functionality of your application, providing more structured and accessible data interaction methods. These changes support better integration and scalability of your services.
Bug Fixes
Resolved issues with column limits by reclassifying LimitsT as a more appropriate StrEnum, improving data model flexibility and stability.
Addressed problems with text-based query plans that hindered query clarity. This feature has been temporarily removed to enhance the overall query generation process.
These updates aim to streamline your workflow, enhance the usability of APIs, and ensure a more robust and flexible data management experience.
Dec 8, 2024
Enhanced Data Handling and Query System Improvements
Feature Updates
Default Values Integration in Query Programs
Introducing default values for slots in query programs enhances the robustness and reliability of code execution. This new feature ensures that all slots are pre-filled with sensible defaults, minimizing errors and improving execution consistency.
New API Endpoint for Query Programs
A new API endpoint now allows retrieval of APIs by queryprogram_id, streamlining access and integration. This update simplifies backend processes, making it easier to manage and interact with specific query programs.
Enhancements & UX Improvements
Improved Query Techniques and Documentation
The query system now boasts enhanced techniques, including improvements to the rank method and support for more complex operations like grouping. Documentation updates, including the relocation of 'technique' comments, enhance guidance and usability for developers working with the system.
Simplified Expression Handling
Significant changes to how expressions are serialized and deserialized in query programs now allow for more efficient processing and reduced ambiguity. This improvement aids in maintaining clarity and precision within the codebase.
Streamlined Preprocessor and Module Loading
Enhancements to the preprocessor ensure better string extraction with correct handling of quotes, while module loading in QueryProgram has been centralized to improve maintainability and performance.
Bug Fixes
Addressed issues with slot replacement in functions from loaded modules, ensuring accurate execution.
Corrected the data model by removing the unused 'dataset_name' field, resolving related errors.
Fixed preprocessor string handling to properly decode escape sequences and strip surrounding quotes, enhancing data processing reliability.
Developer Experience
Schema Generation Advancements
Refinements in schema generation now provide clearer data insights through the addition of 'limits' to data columns. These changes also optimize data sampling methods, improving the robustness and accuracy of data handling processes.
Query Library and Example Enhancements
New functions for event progression and conditional execution have been added to the query library. These enhancements improve the flexibility and functionality of query operations, offering developers more tools to handle complex data scenarios effectively.
This week's updates focus on reinforcing the core functionality of our query systems and enhancing the overall developer experience with more intuitive interfaces and robust data handling capabilities.
Dec 1, 2024
Expanded API Capabilities and Enhanced Query Processing
Feature Updates
Comprehensive API Management with CRUD and Dynamic Documentation
Manage your APIs more effectively with newly implemented CRUD operations for API models and endpoints. Enjoy seamless integration with dynamic OpenAPI specs and live API documentation, ensuring clarity and ease of use for developers.
Advanced Query Programs with Entity Linking and Result Planning
Enhance your query processing by linking queries to data models, allowing for improved recognition and processing. The new entity linking functionality and result plan prologue feature give queries a structured approach, making data insights more accessible.
Enhancements & UX Improvements
Structured Data Handling and Schema Refinement
Experience improved data alignment with the introduction of StructuredData
models, replacing the older schema object. This change enhances data integrity and supports better alignment with data models, streamlining data operations.
Flexible Query Expressions and Auto-Update Model Feature
The expr
method in QueryProgram now accepts additional keyword arguments, offering more flexibility. Additionally, all models now automatically update with changes in the Prisma schema, ensuring your data is always current and accurate.
Bug Fixes
Resolved issues with JSON compatibility in QueryProgram formats, ensuring smoother data exchanges.
Fixed data processing errors by converting NumPy arrays to Python lists, preventing potential crashes.
Corrected the calculation logic in end-to-end tests, enhancing the reliability of test results.
This update brings significant improvements to API management, query flexibility, and data handling. Developers can now leverage enhanced tools and features for more effective and efficient project management.
Nov 24, 2024
Enhanced Query Functionality and Structured Output Support
Feature Updates
Structured Output in LLM Responses
Our latest update enriches the LLM's capabilities by supporting structured outputs, allowing for more organized and meaningful responses. This enhancement introduces a new DataModel for improved data representation and clarity. Developers can now benefit from better error handling and structured data management, making interactions with the system more robust and reliable.
Comprehensive Query System Overhaul
Experience a revamped query system with a streamlined dataline package. This overhaul cleans up obsolete code and introduces a more intuitive approach to handling queries, ensuring a smoother user experience and more efficient data processing.
Enhancements & UX Improvements
Improved Query Examples and Annotations
We've refined our query examples to make them more illustrative and added better annotations for DataFrame methods. These improvements help users understand method returns and leverage new methods for determining DataFrame characteristics like emptiness, width, and height.
Query Preprocessing Enhancements
The QueryPreprocessor now intelligently replaces slots with descriptive labels, enhancing the readability and maintainability of queries. This change ensures that the query processing is not only accurate but also easily understandable by users.
Bug Fixes
Fixed incorrect query generation involving SUM, MIN, and MAX operations by improving documentation to guide better query formulations.
Resolved a bug in the DataFrame method implementation to ensure all necessary operations like the mean are supported.
Fixed message formatting issues in dataline prompts to ensure consistent and correct message delivery.
Nov 17, 2024
Enhanced Query Generation and Improved Deployment Flexibility
Feature Updates
Advanced Query Generation with New Endpoints
We've upgraded our system to leverage the OPENAI model for enhanced functionality. A new endpoint now facilitates asynchronous query generation for datasets and dataline models, significantly improving the efficiency and speed of data processing tasks.
Comprehensive CLI for Server Management
Introducing a robust command-line interface (CLI) for managing Infactory servers. This update enhances usability by allowing users to manage stacks instead of profiles, complete with CRUD operations for better stack management. Enjoy a new launcher UI for interactive stack configuration and deployment, along with improved stack management configurations for seamless user experience.
Enhancements & UX Improvements
Streamlined Kubernetes Deployment
Deployment to Kubernetes has never been easier. We've integrated code updates for seamless deployment to Kubernetes environments, including auto-applying schema changes upon container creation. This ensures smoother transitions and more efficient resource management across multi-platform environments.
Flexible Multi-Platform Environment Setup
With the introduction of Docker and Docker Compose in our staging setup, building and running multi-platform environments has become more straightforward. This change offers greater flexibility and aligns with modern deployment practices.
Bug Fixes
Resolved an issue where default JSON configurations were not being parsed correctly, ensuring data integrity and application stability.
Developer Experience
Improved Endpoint Management
Small but impactful improvements to our API endpoints now include project ID filtering and enhanced data lineage details. These enhancements provide developers with more granular control and insights into data handling, promoting more efficient data management practices.
Nov 10, 2024
Streamlined Data Management and Schema Enhancements
Feature Updates
Enhanced Data Line and Organization Management
Introducing the capability to add project_id
to data lines, allowing for more precise project tracking. Additionally, the clerk_org_id
is now optional when creating an organization, providing greater flexibility in organizational management. This update streamlines the process of associating data lines with specific projects, benefiting project managers and data analysts.
Refined Project and Organization Schemas
We've cleaned up project and organization schemas to improve data handling and integrity. This includes adding a deleted_at
field to the patch endpoint and a filter for the get
endpoint, which makes it easier to manage and query archived data. This refinement enhances the clarity and usability of project data for administrators and developers.
Enhancements & UX Improvements
Consistent Endpoint API
Our API has been refined for consistency, ensuring that argument processing is now aligned with input expectations. This makes API integration smoother and reduces potential errors in data handling, which is particularly beneficial for developers working with our tools.
Improved Dataline Examples
New example data line files have been added to ensure accurate generation and fix previous issues with GeneralColumnT's. This update helps developers and data scientists to better understand and utilize data line formats, leading to improved data processing workflows.
Bug Fixes
Resolved a regression issue where the 'ask dataline' feature was not emitting the correct number of outputs. This fix ensures that the feature delivers reliable results, supporting data analysts in obtaining accurate data insights.
Nov 3, 2024
Enhanced Data Handling and Query Optimizations
Feature Updates
Rank Query Step: Flexible Data Ranking
Introducing a new Rank Query Step that allows for customizable data ranking. You can now compute rankings on specified columns with optional grouping, making it easier to derive insights from complex datasets. This update is particularly beneficial for users who need to analyze data trends and patterns efficiently.
Improved Dataline Generation with Comprehensive Examples
We've enhanced our dataline generation capabilities by replacing the billionaire dataset with a more comprehensive USA cities dataset. This change offers a broader coverage of supported data types, providing a more robust example for leveraging LLMs in data generation tasks.
Enhancements & UX Improvements
Migration to AWS S3: Streamlined Data Management
Say goodbye to Git LFS dependencies! We have transitioned to AWS S3 for cloud storage, enabling seamless separation of code and data. This improvement facilitates better data management and accessibility, especially in Kubernetes environments.
Query Calculations: Better Performance and Accuracy
We've revamped our query calculations by removing Dask dependencies and exclusively using Pandas for DataFrame operations. This shift enhances both the performance and reliability of data processing tasks. Additionally, improvements to our merge and aggregation algorithms ensure faster and more accurate query results.
Developer Experience
Refactored Tool Endpoints for Greater Flexibility
Our tool endpoints have undergone a significant refactor, improving their implementation and testing processes. This update provides developers with a more streamlined workflow for testing and deploying tool APIs, enhancing overall productivity and code clarity.
Oct 27, 2024
Streamlined Query Systems and Enhanced API Management
Feature Updates
Expanded API Endpoints for Better Tool Management
Our API has been expanded to include new routes for managing tools, complete with improved error handling and versioning. Users can now benefit from detailed API documentation and a new logo endpoint to enhance brand visibility.
Methodical Query System Overhaul
The query system has been refined with the introduction of new QuerySteps like Merge, Percentage, Pick, and DIFF. These additions facilitate more complex queries and improve the handling of sub-results within your data analysis workflows. The system now also supports intuitive merging of DataFrame columns, enhancing query accuracy and efficiency.
Enhancements & UX Improvements
Dataset Collection and Management
We've pruned and expanded our dataset offerings with updates to the usa_covid_vaccinations.csv and the addition of the baby_names.csv dataset, making data management more comprehensive and easier to navigate.
Improved Query Syntax
Significant improvements in query syntax have reduced the character count needed for expressions by over 30%, enabling faster query formulation. The use of operator overloading and streamlined class naming conventions make the query process more intuitive and efficient.
Bug Fixes
Addressed a critical bug in the ensure_dask_df call, improving the reliability of data processing functions.
Fixed an issue where datasets without an existing ID column were causing failures. A new mechanism ensures unique key generation, preventing disruptions in data transformations.
Developer Experience
Docker and Deployment Enhancements
The Docker Compose files have been updated to align with Kubernetes staging deployments. This update allows seamless backend and frontend service operations via Docker Compose, setting the stage for future Kubernetes integrations.
Enhanced Testing Framework
Introduced a robust testing framework that includes accuracy tests and comprehensive end-to-end tests across multiple datasets. This ensures the reliability and correctness of query outputs, providing confidence in system operations.
By focusing on these improvements, we aim to deliver a more intuitive and powerful data management experience while ensuring stability and reliability across our systems.
Oct 20, 2024
Enhanced Query System and Streamlined Data Management
Feature Updates
Unified Query Aggregation
All aggregation functions such as average, count, max, min, and sum are now streamlined through a single AggregateColumnsStep. This update simplifies query management and enhances consistency across data processing tasks.
Improved Query Grouping Capabilities
Leverage advanced expression grouping with the new CalculateRowStep, which introduces sophisticated date-time calculations like truncating to hours and minutes, offering more granular data insights.
Expanded Dataset Collection
Introducing a refined dataset lineup, including faang_stocks.csv and nyc_taxi_rides.csv, enhancing your data exploration and analysis capabilities. Obsolete datasets have been removed, ensuring a cleaner, more relevant data environment.
Enhancements & UX Improvements
Workspace Experience Enhancements
Enjoy a smoother command entry and history functionality within your workspace, improving the overall user experience and efficiency during data interactions.
Diverse Workspace Example Updates
Example prompts have been updated to include a wider variety of random workspace names, enriching the diversity and relevance of your workspace scenarios.
Performance & Reliability
Transition to Dask for Query Processing
The query system now utilizes Dask, optimizing performance for large data operations and aligning with best practices for distributed computing.
Codebase Clean-up and Optimization
Significant code clean-up efforts have been made to remove unnecessary components and optimize Dask client usage, aligning more closely with Dask's architecture for better performance and reliability.
Bug Fixes
Resolved bool transformation issues within the transform_data tool, improving data processing accuracy.
Removed the pyicu dependency, transitioning to Pendulum for date management, ensuring more reliable date handling and reducing dependency overhead.
These updates collectively enhance the system's efficiency, usability, and performance, providing users with a more robust and intuitive data management experience.
Oct 13, 2024
Streamlined Deployment and Enhanced Code Organization
Feature Updates
Local Deployment Services: Simplified Setup with Docker Compose
Introducing a robust local deployment solution with the addition of docker-compose.yml
and docker-compose-client.yml
files. This setup allows for seamless local environment configuration, complete with integrated services like pgAdmin and PostgreSQL, making it easier for developers to manage and deploy applications locally.
Enhancements & UX Improvements
Dev Repository Cleanup: Improved Structure and Clarity
We have reorganized our development repository to enhance code clarity and maintainability. Key improvements include:
A refined folder structure for better navigation.
Transitioning to Poetry for dependency management over pip, streamlining package handling.
Enhanced build and deployment scripts for more efficient workflows.
Obsolete File Removal: Cleaner Code Base
Legacy and obsolete files have been pruned from the codebase, resulting in a cleaner and more efficient environment. This cleanup includes the relocation of set-clang-alternatives.py
to a more appropriate directory, ensuring better script organization.
Bug Fixes
Endpoint Class Hierarchy Correction
Addressed a critical issue with the endpoint class hierarchy that affected data processing. The fix ensures that data flows correctly through the system, maintaining integrity and reliability across services.
These updates focus on improving the development experience by simplifying deployment processes, enhancing code organization, and ensuring the stability of key application components.
Oct 6, 2024
Enhanced Infrastructure and Improved Documentation Clarity
Feature Updates
Dataline Dynamic Loading Enhancements
We have restructured the server infrastructure to enable dynamic loading capabilities for Dataline, paving the way for more efficient workshop development. This update will facilitate faster deployment and greater flexibility in managing data processing tasks, benefiting developers and system architects who rely on seamless data integration.
Bug Fixes
Spark Version Upgrade and Documentation Clarity
We've upgraded Spark to the latest versions, ensuring compatibility and improved performance. Additionally, the README has been clarified to provide better guidance, and we've removed hard-coded paths for temporary folders within the Docker setup. These changes enhance the ease of setup and ensure a smoother developer experience, especially for those setting up development environments or troubleshooting configurations.