Azure Storage: Blob

  • Author: Ronald Fung

  • Creation Date: 2 June 2023

  • Next Modified Date: 2 June 2024


A. Introduction

Azure Blob Storage is Microsoft’s object storage solution for the cloud. Blob Storage is optimized for storing massive amounts of unstructured data. Unstructured data is data that doesn’t adhere to a particular data model or definition, such as text or binary data.

Blob Storage is designed for:

  • Serving images or documents directly to a browser.

  • Storing files for distributed access.

  • Streaming video and audio.

  • Writing to log files.

  • Storing data for backup and restore, disaster recovery, and archiving.

  • Storing data for analysis by an on-premises or Azure-hosted service.

Users or client applications can access objects in Blob Storage via HTTP/HTTPS, from anywhere in the world. Objects in Blob Storage are accessible via the Azure Storage REST API, Azure PowerShell, Azure CLI, or an Azure Storage client library. Client libraries are available for different languages, including:

Clients can also securely connect to Blob Storage by using SSH File Transfer Protocol (SFTP) and mount Blob Storage containers by using the Network File System (NFS) 3.0 protocol.

About Azure Data Lake Storage Gen2

Blob Storage supports Azure Data Lake Storage Gen2, Microsoft’s enterprise big data analytics solution for the cloud. Azure Data Lake Storage Gen2 offers a hierarchical file system as well as the advantages of Blob Storage, including:

  • Low-cost, tiered storage

  • High availability

  • Strong consistency

  • Disaster recovery capabilities

For more information about Data Lake Storage Gen2, see Introduction to Azure Data Lake Storage Gen2.


B. How is it used at Seagen

As a biopharma research company using Microsoft Azure, you can use Azure Blob Storage to store and manage unstructured data in the cloud. Here are some ways you can use Azure Blob Storage:

  1. Data storage: Azure Blob Storage can be used to store large volumes of unstructured data related to your biopharma research efforts, such as images, videos, audio files, and documents.

  2. Data sharing: Azure Blob Storage allows you to securely share data with other researchers and stakeholders, helping to facilitate collaboration and knowledge sharing.

  3. Data backup and recovery: Azure Blob Storage can be used for data backup and recovery, providing a reliable and scalable solution for protecting your research data.

  4. Data archiving: Azure Blob Storage allows you to archive data that is no longer actively used, helping to reduce storage costs and improve data management.

  5. Security: Azure Blob Storage provides advanced security features, helping to protect the privacy and security of your research data.

  6. Scalability: Azure Blob Storage is highly scalable, allowing you to easily scale up or down as your data needs change.

  7. Cost-effectiveness: Azure Blob Storage provides a cost-effective solution for storing and managing large volumes of unstructured data, allowing you to pay only for the resources you use.

  8. Availability: Azure Blob Storage provides high availability and disaster recovery capabilities, ensuring that your research data is always available and protected.

Overall, by leveraging Azure Blob Storage, you can effectively store and manage your unstructured research data in the cloud, and collaborate with other researchers and stakeholders to achieve your biopharma research goals. By using Azure Blob Storage for data storage, data sharing, data backup and recovery, data archiving, security, scalability, cost-effectiveness, and availability, you can effectively manage your research data and make data-driven decisions that support your biopharma research efforts.


C. Features

Azure Blob Storage is a cloud-based storage service that provides scalable and cost-effective solutions for storing and managing large volumes of unstructured data. Here are some of the key features of Azure Blob Storage:

  1. Data storage: Azure Blob Storage provides a scalable and cost-effective solution for storing large volumes of unstructured data related to your biopharma research efforts, such as images, videos, audio files, and documents.

  2. Data sharing: Azure Blob Storage allows you to securely share data with other researchers and stakeholders, helping to facilitate collaboration and knowledge sharing.

  3. Data backup and recovery: Azure Blob Storage can be used for data backup and recovery, providing a reliable and scalable solution for protecting your research data.

  4. Data archiving: Azure Blob Storage allows you to archive data that is no longer actively used, helping to reduce storage costs and improve data management.

  5. Security: Azure Blob Storage provides advanced security features, helping to protect the privacy and security of your research data.

  6. Scalability: Azure Blob Storage is highly scalable, allowing you to easily scale up or down as your data needs change.

  7. Durability: Azure Blob Storage provides high durability, ensuring that your research data is always available and protected.

  8. Performance: Azure Blob Storage provides high performance data processing capabilities, allowing you to quickly and efficiently store and retrieve your research data.

  9. Integration: Azure Blob Storage can be integrated with other Azure services, allowing you to move data between systems and perform data integration tasks.

  10. Cost-effectiveness: Azure Blob Storage provides a cost-effective solution for storing and managing large volumes of unstructured data, allowing you to pay only for the resources you use.


D. Where Implemented

LeanIX


E. How it is tested

Testing Azure Blob Storage involves verifying that the service is properly configured and that it is effectively storing and managing your biopharma research data in the cloud. Here are some steps you can take to test Azure Blob Storage:

  1. Verify configuration: Verify that Azure Blob Storage is properly configured and integrated with your Azure account and applications and websites.

  2. Test data storage: Test Azure Blob Storage by storing and managing large volumes of unstructured data related to your biopharma research efforts, ensuring that the service is effectively storing and managing your research data.

  3. Test data sharing: Test Azure Blob Storage by securely sharing data with other researchers and stakeholders, helping to facilitate collaboration and knowledge sharing.

  4. Test data backup and recovery: Test Azure Blob Storage by using it for data backup and recovery, ensuring that the service provides a reliable and scalable solution for protecting your research data.

  5. Test data archiving: Test Azure Blob Storage by archiving data that is no longer actively used, ensuring that the service is effectively reducing storage costs and improving data management.

  6. Test security: Test Azure Blob Storage by verifying that the service provides advanced security features, helping to protect the privacy and security of your research data.

  7. Test scalability: Test Azure Blob Storage by verifying that the service is highly scalable, allowing you to easily scale up or down as your data needs change.

  8. Test durability: Test Azure Blob Storage by verifying that the service provides high durability, ensuring that your research data is always available and protected.

  9. Test performance: Test Azure Blob Storage by verifying that the service provides high performance data processing capabilities, allowing you to quickly and efficiently store and retrieve your research data.

  10. Test cost-effectiveness: Test Azure Blob Storage by verifying that the service provides a cost-effective solution for storing and managing large volumes of unstructured data, allowing you to pay only for the resources you use.

Overall, testing Azure Blob Storage involves verifying that the service is effectively storing and managing your research data in the cloud, and providing powerful data storage, data sharing, data backup and recovery, data archiving, security, scalability, durability, performance, and cost-effectiveness capabilities. By taking these steps, you can ensure that you are effectively using Azure Blob Storage to manage your research data and make data-driven decisions that support your biopharma research efforts.


F. 2023 Roadmap

????


G. 2024 Roadmap

????


H. Known Issues

Like any software or service, there may be known issues or limitations with Azure Blob Storage that users should be aware of. Here are some of the known issues with Azure Blob Storage:

  1. Cost: Azure Blob Storage can be expensive, particularly for organizations that require a high level of storage capacity or that require frequent data access.

  2. Complexity: Azure Blob Storage can be complex to configure and use, particularly for organizations with limited experience in cloud-based storage and data management.

  3. Integration: While Azure Blob Storage can be integrated with other Azure services, there may be compatibility issues that need to be addressed.

  4. Reliability: Azure Blob Storage may experience occasional outages or service disruptions, which can impact the storage and management of your research data.

  5. Performance: Azure Blob Storage may experience performance issues when working with large datasets or complex data management workflows.

  6. Security: While Azure Blob Storage provides advanced security features, there may be security risks associated with storing and managing research data in the cloud.

Overall, while Azure Blob Storage offers a powerful tool for storing and managing unstructured research data in the cloud, users must be aware of these known issues and take steps to mitigate their impact. This may include carefully managing costs to ensure that they stay within their budget, carefully configuring the service to meet the specific needs of their data management workflows, and carefully monitoring data storage and management activity to ensure that the service is effectively storing and managing research data. By taking these steps, users can ensure that they are effectively using Azure Blob Storage to manage their research data and make data-driven decisions that support their biopharma research efforts.


[x] Reviewed by Enterprise Architecture

[x] Reviewed by Application Development

[x] Reviewed by Data Architecture