aws – Study Tech Online

VM Storage Optimisation: Performance & Efficiency

Virtual machines thrive or falter on their storage. Too little I/O, and apps stall; too much over-provisioning, and you waste precious capacity. Today, we’ll unpack how to right-size, streamline, and supercharge VM storage—covering provisioning modes, caching, tiering, and cutting-edge NVMe-over-Fabrics.

1. Choosing the Right Provisioning Mode

Every hypervisor offers multiple disk formats and allocation strategies. Pick the one that balances speed, space efficiency, and manageability:

Thin Provisioning
- Allocates disk space on demand.
- Pros: Saves capacity; perfect for dev/test or unpredictable growth.
- Cons: Can suffer from fragmentation and sudden latency spikes under heavy writes.
Thick Provisioning
- Lazy-Zeroed: Reserves full size but zeros blocks on first write.
- Eager-Zeroed: Zeroes all blocks upfront.
- Pros: Predictable performance; no runtime zeroing penalty.
- Cons: Longer deployment times; consumes full capacity immediately.
Sparse vs. Fully-Allocated (Cloud Disks)

AWS EBS gp3/IO1, Azure Managed Disks, OCI Block Volumes: choose throughput-optimised tiers or burst-capable types.
Match your workload: random I/O needs “provisioned IOPS” tiers; sequential workloads can use HDD/throughput-focused disks.

2. Software-Defined & Hyperconverged Storage

Modern data centres layer intelligence atop raw disks:

VMware vSAN / Azure Stack HCI
- Pool local SSD/HDD into a distributed datastore.
- Inline dedupe/compression on capacity tiers; caching on flash tiers.
Storage Spaces Direct (Windows)
- Mirror-accelerated parity gives you cost-efficient resiliency plus SSD caching.
Ceph / GlusterFS (Open Source)
Scales with commodity hardware; uses erasure coding for high capacity efficiency.

Key tip: carve out a fast caching tier (NVMe or enterprise SSD) and a high-capacity tier (SATA HDD or QLC SSD). Let the system auto-move “hot” blocks onto flash.

3. Caching & Read/Write Acceleration

Host-Level Cache (Local SSD/NVMe)
- Assign a local device as read-cache or write-buffer for shared datastores.
- Thunderbolt/NVMe devices on ESXi or Hyper-V’s Host Cache for jump-start performance.
Guest-Based Cache (In-VM RAM/SSD)
- Tools like Intel Optane DC Persistent Memory act as an ultra-low-latency tier.
- Windows ReadyBoost or Linux’s bcache for specific VM acceleration.
Write Coalescing & De-dupe

Enable zero-copy snapshots (CBT on VMware) to minimise clone impact.
Leverage on-array deduplication for repeatable data patterns (virtual desktops, golden images).

4. Multipathing & Networked Storage

For SAN or NAS-backed VMs, resilience and throughput hinge on proper pathing:

Multipath I/O (MPIO)
- Use ALUA on Fibre Channel or iSCSI; configure round-robin or active/active policies.
- Ensure failover timeouts align with your RTO objectives.
Network Tuning for NFS/iSCSI
- Jumbo frames (MTU 9000) end-to-end to cut packet overhead.
- Separate management, vMotion/live-migration, and storage networks onto dedicated VLANs or VLAN-tagged NICs.
NVMe-over-Fabrics (NVMe-oF)

For extreme IOPS/low latency, expose remote NVMe targets over RDMA (RoCE) or TCP.
Requires RDMA-capable NICs and switch infrastructure; ideal for database VMs or AI/ML workloads.

5. Security & Data Protection

Encryption-At-Rest
- Hypervisor-native: VMware VM Encryption, Azure Disk Encryption, AWS EBS encryption.
- Manage keys via KMIP-compliant key vaults (vCenter KMS, Azure Key Vault, AWS KMS).
Snapshots vs. Backups
- Snapshots are instantaneous but not a substitute for backups—offload backups to object storage (Azure Blob, AWS S3, OCI Object Storage).
- Automate snapshot pruning and lifecycle via scripts or built-in policies to avoid runaway capacity consumption.
Replication & DR

Use vSphere Replication, Azure Site Recovery, AWS Elastic Disaster Recovery, or OCI DRG.
Test your failover runbooks quarterly and validate RPO/RTO under different load scenarios.

6. Putting It All Together

Start by profiling each VM’s I/O pattern: IOPS, throughput, read/write ratio.
Map VMs to storage tiers:
• Gold: Eager-zeroed, NVMe cache, dedicated multipathed FC or NVMe-oF.
• Silver: Lazy-zeroed thick on hybrid vSAN/Storage Spaces with flash caching.
• Bronze: Thin-provisioned HDD or budget cloud disk.
Automate reprovisioning when workloads change—use IaC templates (Terraform, ARM, CloudFormation) for consistency.
Continuously monitor with Prometheus, vRealize, Azure Monitor or CloudWatch dashboards to catch hot spots before users do.

What Is a Virtual Machine?

aws azure vmware windows

Virtual machines (VMs) are the building blocks of modern IT infrastructure—encapsulated “computers” that run on software rather than physical hardware. Think of a VM as a fully functional PC, complete with its own CPU, memory, storage, and network interfaces, but packaged as files and managed by a hypervisor or cloud control plane. Below, we’ll unpack how VMs work in four leading platforms: VMware, Azure, AWS, and Oracle Cloud.

VMware Virtual Machine

At its core, a VMware VM is a guest operating system running atop the ESXi hypervisor on a physical host. Key characteristics:

Isolation & Encapsulation: Each VM lives in its directory of configuration files (.vmx), virtual disks (.vmdk), and logs—fully sandboxed from its neighbours.
Advanced Services: vMotion for live migration, Distributed Resource Scheduler (DRS) for automated load balancing, and snapshots for instant rollbacks give datacenter operators granular control.
On-Premise Focus: Ideal for organisations with existing server investments, private clouds, or strict data-sovereignty requirements.

Azure Virtual Machine

Azure Virtual Machines bring that same VM concept into Microsoft’s public cloud:

Pre-Built Images: Choose from hundreds of Windows and Linux images (including SQL Server, Ubuntu, CentOS) in the Azure Marketplace.
Size & Scale: VM sizes range from tiny burstable instances for dev/test, all the way to GPU-accelerated, high-memory SKUs for AI and big data.
Integrated Services: Plug directly into Azure Virtual Network, Azure Backup, and Azure Monitor, while paying per-second for compute and disk usage.

AWS Virtual Machine (EC2 Instance)

Amazon EC2 instances are VMs on AWS, defined by the underlying Amazon Machine Image (AMI) and instance type:

AMI Foundation: Every VM boots from an AMI, which bundles your OS, application server, and custom settings. You can use AWS-provided AMIs or craft your own.
Instance Types: From tiny t-series burst-capable instances to memory-optimised R-series and compute-optimised C-series, there’s a right fit for every workload.
Elastic Architecture: Auto Scaling Groups let you automatically add or remove instances based on demand, ensuring performance and cost-efficiency.

Oracle Cloud Virtual Machine

Oracle Cloud Infrastructure (OCI) offers VMs as “compute instances” on shared or dedicated hosts:

Flexible vs. Predefined Shapes: Pick a one-size-fits-many shape, or tailor your vCPU and RAM exactly to your needs, then resize on the fly.
Enterprise-Grade Security: Built-in isolation, customer-controlled encryption keys, and SOC-compliant regions make OCI VMs a go-to for regulated industries.
Seamless Integration: Connect your VMs with Oracle’s Autonomous Database, Load Balancer, and high-performance block storage through a low-latency virtual cloud network.

Choosing the Right VM

Whether modernising an on-prem datacenter with VMware, extending the footprint into Azure or AWS, or tapping into Oracle’s enterprise pedigree, virtual machines give:

Rapid provisioning and teardown
Fine-grained resource control
Cost alignment with actual usage
The foundation for containers, serverless functions, and other cloud-native patterns

In the next post, we’ll dive into VM networking patterns and best practices for security and performance tuning. Stay tuned!