Gigasheet Enterprise: Managed Mode

Modified on Mon, 7 Oct at 5:45 PM

Gigasheet Enterprise offers two core operational modes to access data from your data sources: Managed Mode and Live Query Mode. Each mode has distinct use cases and benefits depending on your organization’s data needs. This guide focuses on Managed Mode, providing a comprehensive understanding of its configuration, benefits, trade-offs, and best practices for enterprise data management.


What is Managed Mode?

In Managed Mode, Gigasheet replicates a copy of your data from supported sources, including data warehouses and databases, into its backend storage for analysis. This setup enables scheduled batch refreshes, ensuring your data is periodically synced into Gigasheet. Users then interact with the replicated data to perform ad-hoc exploration, analysis, and reporting without affecting the live data source.


Advantages of Using Managed Mode

  1. Enhanced Performance for Large-Scale Analytics: Since data is hosted within Gigasheet’s infrastructure, users can perform complex operations quickly without putting load on the original data source. This makes it suitable for exploring large datasets without impacting the performance of your data warehouse or database.

  2. Ephemeral Environment for Ad-hoc Analysis: Managed Mode provides a sandbox-like environment where analysts can freely explore and manipulate data without affecting the source data. This encourages experimentation and allows for quick insights without the risk of data corruption or alteration.

  3. Controlled Data Refreshes: The ability to set up scheduled batch refreshes ensures that users work with reasonably current data without having to continuously connect to the source system. This is particularly useful for datasets that are updated on a regular but non-real-time basis.

  4. Data Integrity and Governance: Since only a controlled copy of the data is replicated, governance policies and access controls are simplified. You maintain the integrity of the original data in your source systems, while users can explore the replicated data in Gigasheet without unintended consequences.

  5. Optional Data Write-Back: For workflows requiring data updates or adjustments, Managed Mode supports an optional write-back feature, allowing processed or enriched data to be sent back to the source when needed.

Potential Drawbacks of Managed Mode

  1. Data Duplication and Storage Management: Replicating data into Gigasheet introduces storage redundancy. While this can improve analysis performance, it does require managing an additional layer, potentially adding complexity, especially with large or rapidly growing datasets.

  2. Potential Data Lag: Scheduled data refreshes introduce latency between updates in the source data and the replicated copy in Gigasheet. If your use case requires real-time data, we strongly recommend Live Query mode.

  3. Initial Replication Overhead: The initial data transfer into Gigasheet can be lengthy, particularly for large datasets. Consider factors such as data size, network capacity, and table complexity during setup.

Best Practices for Enterprise Data Management in Managed Mode

  1. Selective Data Replication: Choose the data subsets most relevant for your analysis. Rather than replicating entire tables or datasets, replicate only the necessary columns, rows, or segments to reduce storage overhead and improve performance.

  2. Align Refresh Schedules with Data Needs: Establish a refresh schedule based on how often your source data changes and how fresh you need the data in Gigasheet to be. For example, if your source data is updated weekly, a weekly refresh schedule in Gigasheet ensures you have up-to-date data without excessive syncing.

  3. Monitor Storage and Data Usage Regularly: Regularly review storage usage and data consumption in Gigasheet. Identify and remove obsolete or low-value datasets to keep your storage costs manageable and improve the performance of your data analysis.

  4. Validate Data Before Write-Back: If using the optional write-back functionality, establish a process for validating data transformations before they are written back to the source system. This ensures data consistency and prevents erroneous updates in your core databases or data warehouses.

  5. Implement Data Lifecycle and Archival Policies: Establish policies for data lifecycle management to archive older or less frequently accessed data. This helps optimize the replicated data you actively use in Gigasheet, keeping performance high while adhering to storage best practices.

Use Cases for Managed Mode

Managed Mode is best suited for scenarios where data exploration, analysis, and insights are required on regularly updated datasets without needing real-time access – and when you don't want to tune your warehouse for optimal analytics performance. Example use cases include:

  • Business Operations and Reporting: Analyzing sales, finance, or operational data on a scheduled basis for monthly reports or trend analysis.
  • Marketing Campaign Analysis: Replicating marketing performance data to explore results over different periods and across various channels, without burdening the source system.
  • Product and Customer Data Exploration: Performing exploratory data analysis on product metrics, customer segmentation, or user behaviors based on periodically refreshed datasets.

Conclusion

Managed Mode in Gigasheet Enterprise is a powerful approach for replicating, managing, and analyzing large datasets from a variety of data sources while optimizing performance and maintaining data integrity. By following best practices for data selection, refresh schedules, and governance, organizations can effectively leverage Gigasheet for more efficient and flexible data analysis workflows.

Be sure to check out Live Query Mode, and see how Gigasheet can support real-time querying directly against your data sources without persisting any data outside your cloud environment.

Was this article helpful?

That’s Great!

Thank you for your feedback

Sorry! We couldn't be helpful

Thank you for your feedback

Let us know how can we improve this article!

Select at least one of the reasons
CAPTCHA verification is required.

Feedback sent

We appreciate your effort and will try to fix the article