From Automation to Autonomy – Oracle Autonomous Database and Autonomous Linux Explained

From Automation to Autonomy – Oracle Autonomous Database and Autonomous Linux Explained

As I prepare to embark on a new role at Oracle in October 2024, diving deep into the company’s groundbreaking technologies has been both exciting and essential. In August 2024, I passed seven different Oracle Cloud Infrastructure (OCI) certifications, which helped massively to get a good overview. But I am still lacking information about the history and important advancements of Oracle and OCI over past years. Oracle CloudWorld 2024 came with a lot of great announcements and enhancements, but it is time for a structured approach to close the gaps. Yesterday, I listened to Larry Ellison’s keynote from OpenWorld 2019, which was mainly about the second-generation cloud and autonomous database.

A very good summary of this keynote can be found here.

In 2018, when I joined VMware and started this blog in 2018, the main idea was to share my learnings, knowledge and experience. That is exactly what I am going to do again. And it feels so good to a beginner again at something! 😉 Let’s dive in.

A Technological Revolution

In recent years, the digital landscape has been dramatically transformed by advancements in automation and artificial intelligence (AI). One significant development in this field is the advent of autonomous databases and autonomous operating systems. Oracle has been at the forefront of these innovations, with its Autonomous Database offering and Autonomous Linux OS. 

Autonomous Databases – An Overview

An autonomous database, at its core, is a database that uses machine learning and AI to automate the tedious and complex tasks typically performed by database administrators (DBAs). These tasks include tuning, patching, security management, backups, and system optimization. By eliminating manual processes, autonomous databases provide better security, reduce human error, improve performance, and lower operational costs.

Oracle’s journey toward an autonomous database began with the introduction of Oracle Database 10g in 2003, which introduced “Automatic Storage Management” (ASM) and other automated features. Oracle continued its efforts with subsequent versions, such as Oracle 11g, which featured automatic memory management and automatic diagnostic frameworks. These incremental advances laid the groundwork for full autonomy.

While these automation features improved efficiency, DBAs were still required for many crucial tasks like patch management, security configurations, and performance monitoring.

Machine Learning and AI

The true breakthrough toward autonomous databases came with the integration of machine learning and artificial intelligence. These technologies enabled databases to not only respond to changes in workload patterns but also predict potential issues and optimize system resources in real-time.

Oracle’s Vision – The Autonomous Database

Oracle took a bold step forward with the introduction of its Autonomous Database in 2018. It automates every phase of the database lifecycle, including:

  • Provisioning: The database is automatically created, configured, and tuned for optimal performance without requiring human intervention.
  • Performance Tuning: AI and machine learning algorithms continually monitor and optimize the database to ensure high performance.
  • Patching and Updating: Autonomous databases automatically apply security patches and system updates with zero downtime.
  • Backup and Recovery: Automated backups are performed regularly without any DBA involvement, and recovery is automated in the event of failure.
  • Scaling: The system scales automatically based on demand, ensuring optimal resource utilization and cost-effectiveness.

A list of all key features can be found here.

Choice

Autonomous Database cloud services offer two infrastructure choices:

With serverless (ADB-S), the simplest configuration, multiple customers share the resources of an Exadata cloud infrastructure. These customers can quickly get started with no minimum commitment, enjoying quick database provisioning and independent scalability of compute and storage. Serverless runs on Oracle Cloud Infrastructure.

With dedicated (ADB-D), the customer must first subscribe to a dedicated Exadata cloud infrastructure that is isolated from other tenants, with no shared processor, memory, network, or storage resources. This infrastructure choice offers greater control of the software and infrastructure lifecycle, customizable policies for separation of database workload, software update schedules and versioning, and availability policies. Dedicated infrastructure is available on Oracle Cloud Infrastructure and Exadata Cloud@Customer.

Autonomous Linux – Revolutionizing Operating System Management

Oracle introduced Autonomous Linux in 2019, marking the world’s first autonomous operating system. Just as the autonomous database eliminates the need for manual database administration, Autonomous Linux automates many of the routine and critical tasks associated with operating system management.

Oracle Autonomous Linux builds on Oracle Linux and adds machine learning-driven automation for system management. Some of the key features include:

  • Automated Patching and Updates: Oracle Autonomous Linux automatically applies updates, including security patches, without requiring downtime or rebooting. This continuous update process ensures that systems remain secure and up to date.
  • Self-Tuning: The operating system optimizes itself to ensure that resources are used efficiently, and that performance remains consistent.
  • Fault Detection and Resolution: By leveraging machine learning, Autonomous Linux can detect potential system faults before they become critical issues, reducing downtime and preventing system crashes.
  • Integrated with Oracle Cloud Infrastructure (OCI): Autonomous Linux is tightly integrated with Oracle Cloud Infrastructure, allowing for seamless deployment, monitoring, and scaling.

Ksplice – Zero-Downtime Kernel Updates

One of the standout features of Oracle Autonomous Linux is its use of Ksplice, a technology that enables zero-downtime kernel updates. Traditionally, applying kernel patches required rebooting the system, which could result in service interruptions. Ksplice eliminates this need, allowing kernel updates to be applied in real-time without affecting running applications. This is particularly valuable in high-availability environments where downtime is not an option.

The combination of Autonomous Linux and Oracle Cloud Infrastructure offers organizations a powerful, automated cloud platform that requires minimal hands-on management. Autonomous Linux is ideal for environments where operational efficiency, security, and uptime are critical, such as e-commerce platforms, financial services, healthcare systems, and government infrastructure.

Also here, by reducing the need for manual intervention, Autonomous Linux minimizes human error, improves security posture, and lowers operating costs. 

Enhancements to Oracle’s Autonomous Database Service and Autonomous Linux

Since their initial releases, Oracle’s Autonomous Database and Autonomous Linux have undergone significant enhancements, driven by advancements in machine learning, AI, cloud computing, and feedback from a growing user base.

Initially, Oracle’s Autonomous Database was offered in two main configurations: Autonomous Transaction Processing (ATP) and Autonomous Data Warehouse (ADW). While these were highly specialized, Oracle has since expanded its scope to support a broader range of workloads, such as:

Mixed Workload Capabilities: Initially, ATP was designed for OLTP (Online Transaction Processing) workloads, while ADW was optimized for analytics and data warehousing. Now, Oracle’s Autonomous Database supports mixed workloads, meaning organizations can run both transactional and analytical workloads within the same database. This flexibility is particularly useful for applications that require real-time analytics on transactional data, such as retail platforms or financial services.

Graph and Spatial Analytics: Oracle added native support for graph and spatial analytics, enabling more advanced data processing for IoT, GIS (Geographic Information Systems), and fraud detection applications. This enhancement allows businesses to derive deeper insights from data relationships, making the database more attractive for industries like logistics, smart cities, and social networking platforms.

JSON and NoSQL Support: To meet the needs of modern applications, Oracle introduced JSON and NoSQL data models within the Autonomous Database. This capability makes it easier for developers to build cloud-native, microservices-based applications using document stores, while still benefiting from the full automation and security features of Oracle’s relational database.

Multi-Cloud and Hybrid Cloud Flexibility

To address the growing demand for multi-cloud and hybrid cloud solutions, Oracle has introduced features that allow organizations to integrate their Autonomous Database across various environments.

Oracle Cloud@Customer: With Oracle Cloud@Customer, enterprises can deploy the Autonomous Database in their own data centers while maintaining the full automation and management benefits. This is ideal for organizations that need to keep certain workloads on-premises due to data sovereignty, security, or latency concerns.

Interoperability with Microsoft Azure, Google Cloud and AWS: Oracle and Microsoft established a partnership that allows seamless interoperability between Oracle Cloud and other hyperscalers. This enables customers to run multi-cloud architectures, where they can use Oracle’s Autonomous Database alongside other Azure/AWS/GCP services, such as analytics and AI tools.

Latest Version

The newest release is Oracle Database 23ai

Conclusion

Since their initial launches, both Oracle Autonomous Database and Autonomous Linux have seen significant advancements. Oracle has continually expanded their capabilities to meet the demands of modern enterprises, from increasing the flexibility of workload support in the Autonomous Database to enhancing security and performance in Autonomous Linux.

Oracle CloudWorld 2024 Keynote Summaries

Oracle CloudWorld 2024 Keynote Summaries

The keynotes from day 1 Oracle CloudWorld 2024, held on September 10, spotlighted several significant themes centered around AI, cloud innovation, and partnerships. Oracle CEO Safra Catz opened the event, emphasizing how businesses are solving complex challenges using Oracle’s cloud and AI technologies. Catz introduced high-profile partners and customers, including MGM Resorts, Cloudflare, BNP Paribas, and even the CIA, each sharing insights on how Oracle’s technology is transforming their operations.

OCW 2024 Safra Catz CIA

A major announcement involved Oracle’s expanding partnerships, including a collaboration with Amazon Web Services (AWS) to offer Oracle Database@AWS, which simplifies multi-cloud operations. Catz highlighted Oracle’s vision of creating a seamless experience across different cloud providers, helping businesses improve security, efficiency, and customer satisfaction.

Safra Catz’s Opening Keynote

Oracle CEO Safra Catz started the event by emphasizing Oracle’s commitment to assisting businesses in succeeding in an increasingly digital and AI-driven world. She pointed out how companies are using Oracle Cloud Infrastructure (OCI) and Autonomous Database to enhance operational efficiency, improve customer experiences, and control costs. She consistently emphasized the importance of partnerships and multi-cloud strategies, which are now essential as businesses navigate complex cloud environments.

Key announcements:

  • Oracle’s expanded partnerships with AWS and Google Cloud. These collaborations allow businesses to run Oracle Database services, including Oracle Autonomous Database, across these platforms. This opens up Oracle’s cloud technologies to a wider audience, enhancing cross-cloud operability​.
  • Oracle Database@AWS: A new service enabling businesses to access Oracle’s Autonomous Database on Amazon Web Services with unified support, billing, and administration across platforms​.

With Oracle Database@Google Cloud, customers will for the first time get direct access to Oracle Database services running on OCI and deployed in Google Cloud datacenters. They can now take advantage of Oracle’s industry-leading database and Exadata technology to accelerate innovation and develop new applications. In addition, customers can run applications on Oracle Linux, which is now supported by Oracle on Google Cloud. Oracle Linux images can be imported using Google Cloud’s virtual disk image import process. Within the next 12 months, customers are also expected to be able to streamline Oracle Linux image provisioning in Google Compute Engine with ready-to-use images.

Larry Ellison’s Keynote on Oracle’s Vision and Strategy

Co-founder and CTO Larry Ellison’s keynote reinforced Oracle’s vision for the future of cloud computing, focusing on how AI and automation will reshape industries. Ellison discussed Oracle’s strategy of integrating AI deeply into its cloud offerings, particularly around data analytics, business processes, and decision-making automation.

OCW 2024 Larry Multi-Cloud

He highlighted:

  • Oracle’s AI-driven Fusion Applications and Autonomous Database, which help businesses automate repetitive tasks, reduce errors, and improve decision-making speed.
  • The importance of multi-cloud capabilities, stressing Oracle’s commitment to building seamless multi-cloud experiences, particularly with AWS and Google Cloud, as demonstrated by Oracle Database@AWS and Oracle Database@Google Cloud.

OCW 2024 Oracle Database at AWS

We are seeing huge demand from customers that want to use multiple clouds. To meet this demand and give customers the choice and flexibility they want, Amazon and Oracle are seamlessly connecting AWS services with the very latest Oracle Database technology, including the Oracle Autonomous Database. With Oracle Cloud Infrastructure deployed inside of AWS datacenters, we can provide customers with the best possible database and network performance.

OCW 2024 Oracle Cloud Data Centers

Ellison also mentioned Oracle’s commitment to autonomous security, showcasing how Oracle is leveraging AI to combat cybersecurity threats such as ransomware and identity theft. This next-gen technology aims to autonomously detect and mitigate cyber risks across networks, reflecting Oracle’s push toward more secure and efficient cloud environments.

Juan Loaiza’s Keynote on Generative Data Dev and App Dev

Juan Loaiza, Oracle’s Executive Vice President of Mission-Critical Database Technologies, introduced the concept of Generative Data Development (GenDev), a transformative approach to application and data development powered by AI. This new development paradigm allows enterprise applications to be generated automatically, focusing on both data and application logic without the need for manual coding.

Juan Loaiza Keynote on Generative Data Dev and App Dev_Oracle CloudWorld 2024

Here are the key highlights:

  • AI-Centric Development: The introduction of Oracle Database 23ai, the engine behind this shift, enables developers to harness the power of AI to generate modular and evolvable enterprise applications. This process automates the handling of complex data, ensuring that integrity, validation, and confidentiality are built into the core of the application.
  • Generative Development Infrastructure: GenDev provides a framework where developers can interact with applications and databases using natural language, simplifying the process of building, querying, and managing enterprise applications. This generative approach streamlines app development, making it faster and more intuitive.
  • Autonomous Database Integration: Oracle Database 23ai is fully integrated with Autonomous Database, featuring enhancements like retrieval-augmented generation (RAG), AI Vector Search, and built-in support for multiple large language models (LLMs) like Google Gemini and Anthropic Claude. This integration helps avoid issues like AI hallucinations and allows developers to retrieve precise, enterprise-grade data.
  • GPU and AI Model Support: By supporting NVIDIA GPUs, the Autonomous Database accelerates resource-intensive operations, especially those related to AI and deep learning models, making it easier for developers to implement advanced AI features without managing complex infrastructure.
  • Low-Code and No-Code Tools: Oracle also introduced a suite of tools like Data Studio AI and Graph Studio to enable non-developers to interact with data and build applications through visual interfaces. This further lowers the barrier to entry for businesses to leverage AI in their operations.

OCW 2024 Low-code APEX

T.K. Anand’s Keynote on Oracle Data Intelligence

T.K. Anand, Executive Vice President of Oracle Analytics, delivered a keynote focused on how Oracle is transforming data management and analytics with AI-powered solutions. One of the key announcements was the introduction of the Oracle Intelligent Data Lake, designed to simplify data management and eliminate data silos.

OCW2024 Oracle Intelligent Data Lake

This new solution integrates structured and unstructured data sources, allowing organizations to gain a more complete business view. The Data Lake also leverages Apache Spark and Apache Flink for real-time data processing and includes AI-driven features that streamline workflows.

Note: GA is expected next year in 2025

Anand emphasized how Oracle’s Data Intelligence Platform combines advanced analytics, data orchestration, and governance to deliver AI-powered insights at scale. This platform allows businesses to reduce administrative workloads, improve operational efficiency, and make data-driven decisions faster.

A critical feature introduced was the Oracle Analytics Cloud AI Assistant, which uses large language models to offer natural language-driven analytics, helping users to interact more intuitively with their data.

The top image illustrates the standard approach to building, which requires both knowledge of the product and data sets. In contrast, the bottom image shows the AI-assisted method for modifying the dashboard.

Additionally, Oracle’s Autonomous Database innovations were highlighted, such as RAG support (retrieval-augmented generation) and integration with GPU processing, enhancing performance for AI-driven tasks.

Oracle CloudWorld Day 2

On September 11, three key executives – Steve Miranda, Clay Magouyrk, and Mike Sicilia – took the stage to discuss Oracle’s latest innovations, strategies, and vision for the future. Each session highlighted groundbreaking advancements in cloud technology, AI, and industry-specific solutions, with a strong focus on helping businesses leverage these technologies to improve operations and drive innovation.

Steve Miranda’s Keynote on Oracle Fusion Applications

Steve Miranda, Executive Vice President of Applications Development, delivered a keynote centered on how Oracle is embedding artificial intelligence into its Fusion Applications Suite. He emphasized how AI is transforming every facet of business operations—from HR and finance to supply chain and customer experience—making it easier for companies to achieve better outcomes without requiring specialized expertise.

A standout feature of Miranda’s presentation was Oracle’s approach to making AI accessible. Unlike other cloud competitors that are introducing price hikes for AI-powered features, Oracle is incorporating AI capabilities into its existing Fusion Cloud Applications at no additional cost. Miranda humorously dismissed the idea of separate pricing for AI features, likening it to the absurdity of charging extra for fundamental software upgrades. His message was clear: AI is a natural progression in software evolution and should be available to all users without extra costs.

OCW 2024 Steve Miranda Trusted AI Partner

Miranda showcased how AI-driven automation and analytics are now seamlessly integrated across Oracle Fusion Applications. He gave concrete examples of how this benefits users – such as automated invoice processing in finance, predictive maintenance in supply chains, and enhanced employee engagement in HR. Through these real-world examples, Miranda highlighted how businesses can reduce manual workloads, make more informed decisions, and enhance efficiency by tapping into Oracle’s AI innovations.

 

Clay Magouyrk’s Keynote on AI, Speed, Flexibility, and Security

Clay Magouyrk, Executive Vice President of Oracle Cloud Infrastructure (OCI), used his keynote to unveil Oracle’s advancements in cloud technology, focusing on the flexibility, speed, and security of Oracle Cloud Infrastructure. His talk positioned OCI as the backbone of a new era of cloud computing. The big announcement he introduced was Dedicated Region25 – now starting as small as 3 racks!

OCW 2024 Clay Dedicated Region25 3 Racks

A new OCI Dedicated Region configuration – Dedicated Region25 – will be available in a smaller, scalable size starting at only three racks and rapidly deployable within weeks. Dedicated Region25 has a 75 percent smaller launch footprint and simplified datacenter requirements, supports OCI’s 150+ AI and cloud services and allows a wider range of customers to gain the agility, economics, and scale of the public cloud in their own datacenters. The new configuration will be available in the next calendar year.

Clay emphasized that Oracle’s cloud strategy is all about reducing complexity while delivering high performance and security. He introduced several new offerings aimed at simplifying the cloud journey for businesses. One of the central themes of his presentation was Oracle’s commitment to multi-cloud environments. Acknowledging that most enterprises operate in hybrid and multi-cloud landscapes, Magouyrk showcased how OCI’s interoperability with other cloud platforms like Google Cloud allows customers to deploy applications across different environments seamlessly.

Mike Sicilia’s Keynote on Solving Industries’ Biggest Challenges with Applied AI

Mike Sicilia, Oracle’s Executive Vice President of Industries, provided a detailed look at how Oracle is addressing the unique challenges of various industries through its vertical solutions. His session focused on Oracle’s deep industry expertise and how the company is using cloud technology to drive digital transformation in sectors such as healthcare, financial services, construction, and government.

OCW 2024 Vertical Solutions

Sicilia explained that Oracle’s industry-specific solutions are not just about generic cloud applications but are tailored to meet the regulatory and operational complexities of each industry.

Other Announcements and News

Please find below additional information and announcements:

From Cloud-First to Cloud-Smart to Repatriation

From Cloud-First to Cloud-Smart to Repatriation

VMware Explore 2024 happened this week in Las Vegas. I think many people were curious about what Hock Tan, CEO of Broadcom, had to say during the general session. He delivered interesting statements and let everyone in the audience know that “the future of enterprise is private – private cloud, private AI, fueled by your own private data“. On social media, the following slide about “repatriation” made quite some noise:

VMware Explore 2024 Keynote Repatriation

The information on this slide came from Barcley’s CIO Survey in April 2024 and it says that 8 out of 10 CIOs today are planning to move workloads from the public cloud back to their on-premises data centers. It is interesting, and in some cases even funny, that other vendors in the hardware and virtualization business are chasing this ambulance now. Cloud migrations are dead, let us do reverse cloud migrations now. Hybrid cloud is dead, let us do hybrid multi-clouds now and provide workload mobility. My social media walls are full of such postings now. It seems Hock Tan presented the Holy Grail to the world.

Where is this change of mind from? Why did only 43% during COVID-19 plan a reverse cloud migration and now “suddenly” more than 80%?

I could tell you the story now about cloud-first not being cool anymore, that organizations started to follow a smarter cloud approach, and then concluded that cloud migrations are still not happening based on their expectations (e.g., costs and complexity). And that it is time now to bring workloads back on-premises. It is not that simple.

I looked at Barclay’s CIO survey and the chart (figure 20 in the survey) that served as a source for Hock Tan’s slide:

Barclays CIO Survey April 2024 Cloud RepatriationWe must be very careful with our interpretation of the results. Just because someone is “planning” a reverse cloud migration, does it mean they are executing? And if they execute such an exercise, is this going to be correctly reflected in a future survey?

And which are the workloads and services that are brought back to an enterprise’s data center? Are we talking about complete applications? Or is it more about load balancers, security appliances, databases and storage, and specific virtual machines? And if we understand the workloads, what are the real reasons to bring them back? Figure 22 of the survey shows “Workloads that Respondents Intend to Move Back to Private Cloud / On-Premise from Public Cloud”:

Barclays CIO Survey April 2024 Workload to migrate

Okay, we have a little bit more context now. Just because some workloads are potentially migrated back to private clouds, what does it mean for public cloud vs. private cloud spend? Question #11 of the survey “What percentage of your workloads and what percentage of your total IT spend are going towards the public cloud, and how have those evolved over time?” focuses on this matter.

Barclays CIO Survey April 2024 Percentage of Workloads and Spend My interpretation? Just because one slide or illustration talks about repatriation does not mean, that the entire world is just doing reverse migrations now. Cloud migrations and reverse cloud migrations can happen at the same time. You could bring one application or some databases back on-premises but decide to move all your virtual desktops to the public cloud in parallel. We could still bring workloads back to our data center and increase public cloud spend. 

Sounds like cloud-smart again, doesn’t it? Maybe I am an organization that realized that the applications A, B, C, and D shouldn’t run in Azure, AWS, Google, and Oracle anymore, but the applications W, X, Y, and Z are better suited for these hyperscalers.

What else?

I am writing about my views and my opinions here. There is more to share. During the pandemic, everything had to happen very quickly, and everyone suddenly had money to speed up migrations and application modernization projects. After that, I think it is a natural thing that everything was slowing down a bit after this difficult and exhausting phase.

Some of the IT teams are probably still documenting all their changes and new deployments on an internal wiki, and their bosses started to hire FinOps specialists to analyze their cloud spend. It is no shocking surprise to me that some of the financial goals haven’t been met and result in a reverse cloud migration a few years later.

But that is not all. Try to think about the past years. What else happened?

Yes, we almost forgot about Artificial Intelligence (AI) and Sovereign Clouds.

Before 2020, not many of us were thinking about sovereign clouds, data privacy, and AI.

Most enterprises are still hosting their data on-premises behind their own firewall. And some of this data is used to train or finetune models. We see (internal) chatbots popping up using Retrieval Augmented Generation (RAG), which delivers answers based on actual data and proprietary information.

Okay. What else? 

Yep, there is more. There are new technologies and offerings available that were not here before. We just covered AI and ML (machine learning) workloads that became a potential cost or compliance concern.

The concept of sovereign clouds has gained traction due to increasing concerns about data sovereignty and compliance with local regulations.

The adoption of hybrid and hybrid multi-cloud strategies has been a significant trend from 2020 to 2024. Think about VMware’s Cloud Foundation approach with Azure, Google, Oracle etc., AWS Outposts, Azure Stack, Oracle’s DRCC, or Nutanix’s.

Enterprises started to upskill and train their people to deliver their own Kubernetes platforms.

Edge computing has emerged as a crucial technology, particularly for industries like manufacturing, telecommunications, and healthcare, where real-time data processing is critical.

Conclusion

Reverse cloud migrations are happening for many different reasons like cost management, performance optimization, data security and compliance, automation and operations, or because of lock-in concerns.

Yes, (cloud) repatriation became prominent, but I think this is just a reflection of the maturing cloud market – and not an ambulance.

And no, it is not a better moment to position your hybrid multi-cloud solutions, unless you understand the services and workloads that need to be migrated from one cloud to another. Just because some CIOs plan to bring back some workloads on-premises, does it mean/imply that they will do it? What about the sunk cost fallacy?

Perhaps IT leaders are going to be more careful in the future and are trying to find other ways for potential cost savings and strategic benefits to achieve their business outcomes – and keep their workloads in the cloud versus repatriating them.

Businesses are adopting a more nuanced workload-centric strategy.

What’s your opinion?

Distributed Hybrid Infrastructure Offerings Are The New Multi-Cloud

Distributed Hybrid Infrastructure Offerings Are The New Multi-Cloud

Since VMware belongs to Broadcom, there was less focus and messaging on multi-cloud or supercloud architectures. Broadcom has drastically changed the available offerings and VMware Cloud Foundation is becoming the new vSphere. Additionally, we have seen big changes regarding the partnerships with hyperscalers (the Azures and AWSes of this world) and the VMware Cloud partners and providers. So, what happened to multi-cloud and how come that nobody (at Broadcom) talks about it anymore?

What is going on?

I do not know if it’s only me, but I do not see the term “multi-cloud” that often anymore. Do you? My LinkedIn feed is full of news about artificial intelligence (AI) and how Nvidia employees got rich. So, I have to admit that I lost track of hybrid clouds, multi-clouds, or hybrid multi-cloud architectures. 

Cloud-Inspired and Cloud-Native Private Clouds

It seems to me that the initial idea of multi-cloud has changed in the meantime and that private clouds are becoming platforms with features. Let me explain.

Organizations have built monolithic private clouds in their data centers for a long time. In software engineering, the word “monolithic” describes an application that consists of multiple components, which form something larger. To build data centers, we followed the same approach by using different components like compute, storage, and networking. And over time, IT teams started to think about automation and security, and the integration of different solutions from different vendors.

The VMware messaging was always pointing in the right direction: They want to provide a cloud operating system for any hardware and any cloud (by using VMware Cloud Foundation). On top of that, build abstraction layers and leverage a unified control plane (aka consistent automation and operations).

And I told all my customers since 2020 that they need to think like a cloud service provider, get rid of silos, implement new processes, and define a new operating model. That is VMware by Broadcom’s messaging today and this is where they and other vendors are headed: a platform with features that provide cloud services.

In other words, and this is my opinion, VMware Cloud Foundation is today a platform with different components like vSphere, vSAN, NSX, Aria, and so on. Tomorrow, it is still called VMware Cloud Foundation, a platform that includes compute, storage, networking, automation, operations, and other features. No more other product names, just capabilities, and services like IaaS, CaaS, DRaaS or DBaaS. You just choose the specs of the underlying hardware and networking, deploy your private clouds, and then start to build and consume your services.

Replace the name “VMware Cloud Foundation” in the last paragraph with AWS Outposts or Azure Stack. Do you see it now? Distributed unmanaged and managed hybrid cloud offerings with a (service) consumption interface on top.

That is the shift from monolithic data centers to cloud-native private clouds.

From Intercloud to Multi-Cloud

It is not the first time that I write about interclouds, that not many of us know. In 2012, there was this idea that different clouds and vendors need to be interoperable and agree on certain standards and protocols. Think about interconnected private and public clouds, which allow you to provide VM mobility or application portability. Can you see the picture in front of you? What is the difference today in 2024?

In 2023, I truly believed that VMware figured it out when they announced VMware Cloud on Equinix Metal (VMC-E). To me, VMC-E was different and special because of Equinix, who is capable of interconnecting different clouds, and at the same time could provide a baremetal-as-a-service (BMaaS) offering.

Workload Mobility and Application Portability

Almost 2 years ago, I started to write a book about this topic, because I wanted to figure out if workload mobility and application portability are things, that enterprises are really looking for. I interviewed many CIOs, CTOs, chief architects and engineers around the globe, and it became VERY clear: it seems nobody was changing anything to make app portability a design requirement.

Almost all of the people I have spoken to, told me, that a lot of things must happen that could trigger a cloud-exit and therefore they see this as a nice-to-have capability that helps them to move virtual machines or applications faster from one cloud to another.

VMware Workload Mobility

And I have also been told that a lift & shift approach is not providing any value to almost all of them.

But when I talked to developers and operations teams, the answers changed. Most of them did not know that a vendor could provide mobility or portability. Anyway, what has changed now?

Interconnected Multi-Clouds and Distributed Hybrid Clouds

I mentioned it already before. Some vendors have realized that they need to deliver a unified and integrated programmable platform with a control plane. Ideally, this control plane can be used on-premises, as a SaaS solution, or both. And according to Gartner, these are the leaders in this area (Magic Quadrant for Distributed Hybrid Infrastructure):

Gartner Magic-Quadrant-for-Distributed-Hybrid-Infrastructure

In my opinion, VMware and Nutanix are providing a hybrid multi-cloud approach.

AWS and Microsoft are providing hybrid cloud solutions. In Microsoft’s case, we see Azure Stack HCI, Azure Kubernetes Service (AKS incl. Hybrid AKS) and Azure Arc extending Microsoft’s Azure services to on-premises data centers and edge locations.

The only vendor, that currently offers true multi-cloud capabilities, is Oracle. Oracle has Dedicated Region Cloud@Customer (DRCC) and Roving Edge, but also partnerships with Microsoft and Google that allow customers to host Oracle databases in Azure and Google Cloud data centers. Both partnerships come with a cross-cloud interconnection.

That is one of the big differences and changes for me at the moment. Multi-cloud has become less about mobility or portability, a single global control plane, or the same Kubernetes distribution in all the clouds, but more about bringing different services from different cloud providers closer together.

This is the image I created for the VMC-E blog. Replace the words “AWS” and “Equinix” with “Oracle” and suddenly you have something that was not there before, an interconnected multi-cloud.

What’s Next?

Based on the conversations with my customers, it does not feel that public cloud migrations are happening faster than in 2020 or 2022 and we still see between 70 and 80% of the workloads hosted on-premises. While we see customers who are interested in a cloud-first approach, we see many following a hybrid multi-cloud and/or multi-cloud approach. It is still about putting the right applications in the right cloud based on the right decisions. This has not changed.

But the narrative of such conversations has changed. We will see more conversations about data residency, privacy, security, gravity, proximity, and regulatory requirements. Then there are sovereign clouds.

Lastly, enterprises are going to deploy new platforms for AI-based workloads. But that could still take a while.

Final Thoughts

As enterprises continue to navigate the above mentioned complexities, the need for flexible, scalable, and secure infrastructure solutions will only grow. There are a few compelling solutions that bridge the gap between traditional on-premises systems and modern cloud environments.

And since most enterprises are still hosting their workloads on-premises, they have to decide if they want to stretch the private cloud to the public cloud, or the other way around. Both options can co-exist, but would make it too big and too complex. What’s your conclusion?

VMware Explore 2023 US – Day 1 Announcements

VMware Explore 2023 US – Day 1 Announcements

VMware Explore 2023 US is currently happening in Las Vegas and I am onsite! Below you will find an overview of the information that was shared with us during the general session and solution keynotes.

Please be aware that this list is not complete but it should include all the major announcements including references and sources.

VMware Aria and VMware Tanzu

Starting this year, VMware Aria and VMware Tanzu form a single track at VMware Explore and VMware introduced the develop, operate, and optimize pillars (DOO) for Aria and Tanzu around April 2023.

VMware Tanzu DOO Framework

The following name changes and adjustments have been announced at VMware Explore US 2023:

  • The VMware Tanzu portfolio includes two new product categories (product family) called “Tanzu Application Platform” and “Tanzu Intelligence Services”.
  • Tanzu Application Platform includes the products Tanzu Application Platform (TAP) and Tanzu for Kubernetes Operations (TKO), and the new Tanzu Application Engine module.
  • Tanzu Intelligence Services – Aria Cost powered by CloudHealth, Aria Guardrails, Aria Insights, and Aria Migration will be rebranded as “Tanzu” and become part of this new Tanzu Intelligence Services category.
    • Tanzu Hub & Tanzu Graph
    • Tanzu CloudHealth
    • Tanzu Guardrails
    • Tanzu Insights (currently known as Aria Insights)
    • Tanzu Transformer (currently known as Aria Migration)
  • Aria Hub and Aria Graph are now called Tanzu Hub
  • VMware Cloud Packs are now called the VMware Cloud Editions (more information below)

Note: VMware expects to implement these changes latest by Q1 2024

The VMware Aria and Tanzu announcement and rebranding information can be found here.

Tanzu Mission Control

After the announcement that Tanzu Mission Control supports the lifecycle management of Amazon EKS clusters, VMware announced the expansion to provide lifecycle management capabilities of Microsoft AKS clusters now as well. 

Tanzu Application Engine (Private Beta)

VMware announced a new solution for the Tanzu Application Platform category.

VMware Tanzu for Kubernetes Operations is introducing Tanzu Application Engine, enhancing multi-cloud support with lifecycle management of Azure AKS clusters, and offering new Kubernetes FinOps (cluster cost) visibility. A new abstraction that includes workload placement, K8s runtime, data services, libraries, infra resources, with a set of policies and guardrails.

The Tanzu Application Engine announcement can be found here.

VMware RabbitMQ Managed Control Plane

I know a lot of customers who built an in-house RabbitMQ cloud service.

VMware just announced a beta program for a new VMware RabbitMQ Managed Control Plane which allows enterprises to seamlessly integrate RabbitMQ within their existing cloud environment, offering flexibility and control over data streaming processes.

What’s New with VMware Aria?

Other Aria announcements can be found here.

What’s New with VMware Aria Operations at VMware Explore

Next-Gen Public Cloud Management with VMware Aria Automation

VMware Cloud Editions

What has started with four different VMware Cloud Packs, is now known as “VMware Cloud Editions” with five different options:

VMware Cloud Editions

Here’s an overview of the different solutions/subscriptions included in each edition:

VMware Cloud Editions Connected Subscriptions

More VMware Cloud related announcements can be found here.

What’s New in vSphere 8 Update 2

As always, VMware is working on enhancing operational efficiency to make the life of an IT admin easier. And this gets better with the vSphere 8 U2 release.

In vSphere 8 Update 2, we are making significant improvements to several areas of maintenance to reduce and in some cases eliminate this need for downtime so vSphere administrators can make those important maintenance changes without having a large impact on the wider vSphere infrastructure consumers.

These enhancements include, reduced downtime upgrades for vCenter, automatic vCenter LVM snapshots before patching and updating, non-disruptive certificate management, and reliable network configuration recovery after a vCenter is restored from backup.

More information about the vSphere 8 Update 2 release can be found here.

What’s New in vSAN 8 Update 2

At VMware Explore 2022, VMware announced the new vSAN 8.0 release which included the new Express Storage Architecture (ESA), which even got better with the recent vSAN 8.0 Update 1 release.

VMware vSAN Max – Petabyte-Scale Disaggregated Storage

VMware vSAN Max, powered by vSAN Express Storage Architecture, is a new vSAN offering in the vSAN family delivering
petabyte-scale disaggregated storage for vSphere. With its new disaggregated storage deployment model, vSAN customers can scale storage elastically and independently from compute and deploy unified block, file, and partner-based object storage to maximize utilization and achieve lower TCO.

VMware vSAN Max

vSAN Max expands the use cases in which HCI can provide exceptional value. Disaggregation through vSAN Max provides flexibility to build infrastructure with the scale and efficiency required for non-linear scaling applications, such as storage-intensive databases, modern elastic applications with large datasets and more. Customers have a choice of deploying vSAN in a traditional model or a disaggregated model with vSAN Max, while still using a single control plane to manage both deployment options.

The vSAN Max announcement can be found here.

VMware Cloud on AWS

VMware announced a VMware Cloud on AWS Advanced subscription tier that will be available on i3en.metal and i4i.metal instance types only. This subscription will include advanced cloud management, networking and security features:

  • VMware NSX+ Services (NSX+ Intelligence, NDR capabilities, NSX Advanced Load Balancer)
  • vSAN Express Storage Architecture Support
  • VMware Aria Automation
  • VMware Aria Operations
  • VMware Aria Operations for Logs

Note: Existing deployments (existing SDDCs) will be entitled to these advanced cloud management, networking and security features over time

The VMware Cloud on AWS Advanced Subscription Tier FAQ can be found here

Introduction of VMware NSX+

Last year, VMware introduced Project Northstar as technology preview:

Project Northstar is a SaaS-based networking and security offering that will empower NSX customers with a set of on-demand multi-cloud networking and security services, end-to-end visibility, and controls. Customers will be able to use a centralized cloud console to gain instant access to networking and security services, such as network and security policy controls, Network Detection and Response (NDR), NSX Intelligence, Advanced Load Balancing (ALB), Web Application Firewall (WAF), and HCX. It will support both private cloud and VMware Cloud deployments running on public clouds and enable enterprises to build flexible network infrastructure that they can spin up and down in minutes.

This year, VMware announced the initial availability of the NSX+ service. VMware NSX+ is a fully managed cloud-based service offering that allows networking, security, and operations teams to consume and operate VMware NSX services from a single cloud console across private and public clouds.

NSX+ Architectural Diagram

The following services are available:

  • NSX+ Policy Management: Provides unified networking and security policy management across multiple clouds and on-premises data centers.
  • NSX+ Intelligence (Tech Preview only): Provides a big data reservoir and a system for network and security analytics for real-time traffic visibility into applications traffic all the way from basic traffic metrics to deep inspection of packets.
  • NSX+ NDR (Tech Preview only): Provides a scalable threat detection and response service offering for Security Operations Center (SoC) teams to triage real time security threats to their data center and cloud.

There are three different NSX+ and two NSX+ distributed firewall editions available:

  • NSX+ Standard. For organizations needing a basic set of NSX connectivity and security features for single location software-defined data center deployments.
  • NSX+ Advanced. For organizations needing advanced networking and security features that are applied to multiple sites. This edition also entitles customers to VMware NSX+ Advanced Load Balancer Cloud Services.
  • NSX+ Enterprise. For organizations needing all of the capability NSX has to offer. This edition also entitles customers to VMware NSX+ Advanced Load Balancer Cloud Services.
  • NSX+ Distributed Firewall. For organizations needing implement access controls for east-west traffic within the network (micro-segmentation) but not focused on Threat detection and prevention services.
  • NSX+ Distributed Firewall with Threat Prevention. For organizations needing access control and select Threat prevention features for east-west traffic within the network. 

An NSX+ feature overview can be found here.

Note: Currently, NSX+ only supports NSX on-premises deployments (NSX 4.1.1 or later) and VMware Cloud on AWS

VMware Cloud Foundation

VMware announced a few innovations for H2 2023, which includes the support for Distributed Service Engine (DSE aka Project Monterey), vSAN ESA support, and NSX+.

 

Generative AI – VMware Private AI Foundation with Nvidia

VMware and Nvidia’s CEOs announced VMware Private AI Foundation as the result of their longstanding partnership. 

Built on VMware Cloud Foundation, this integrated solution with Nvidia will enable enterprises to customize models and run generative AI applications, including intelligent chatbots, assistants, search, and summarization.

Bild

Anywhere Workspace Announcements

At VMware Explore 2022, VMware shared its vision for autonomous workspaces.

Autonomous workspace is a concept (not an individual product) that is our north star for the future of end-user computing. It means going beyond creating a unified workspace with basic automations, to analyzing huge amounts of data with AI and machine learning, to drive more advanced, context aware automations. This leads to a workspace that can be considered self-configuring, self-healing, and self-securing. 

VMware continued working on the realization of this vision and came up with a lot of announcements, which can be found here.

Other Announcements

Please find below some announcements that VMware shared with us during the SpringOne event or before and after the general session on August 22nd, 2023:

Momentum in the Cloud: Crafting Your Winning Strategy with VMware Cloud

Momentum in the Cloud: Crafting Your Winning Strategy with VMware Cloud

The time is right for VMware Cloud! In the rapidly evolving landscape of modern business, embracing the cloud has become essential for organizations seeking to stay competitive and agile. The allure of increased scalability, cost-efficiency, and flexibility has driven enterprises of all sizes to embark on cloud migration journeys. However, the road to a successful cloud adoption is often coming with challenges. Slow and failed migrations have given rise to what experts call the “cloud paradox,” where the very technology meant to accelerate progress ends up hindering it.

As businesses navigate through this paradox, finding the right strategy to harness the full potential of the cloud becomes paramount. One solution that has emerged as a beacon of hope in this complex landscape is VMware Cloud. With its multi-cloud approach, which is also known as supercloud, VMware Cloud provides organizations the ability to craft a winning strategy that capitalizes on momentum while minimizing the risks associated with cloud migrations.

The Experimental Phase is Over

Is it really though? The experimental phase was an exciting journey of discovery for organizations seeking the potential of multi-cloud environments. Companies have explored different cloud providers, tested a variety of cloud services, and experimented with workloads and applications in the cloud. It allowed them to understand the benefits and drawbacks of each cloud platform, assess performance, security and compliance aspects, and determine how well each cloud provider aligns with their unique business needs.

The Paradox of Cloud and Choice

With an abundance of cloud service providers, each offering distinct features and capabilities, decision-makers can find themselves overwhelmed with options. The quest to optimize workloads across multiple clouds can lead to unintended complexities, such as increased operational overhead, inconsistent management practices/tools, and potential vendor lock-in.

Furthermore, managing data and applications distributed across various cloud environments can create challenges related to security, compliance, and data sovereignty. The lack of standardized practices and tools in a multi-cloud setup can also hinder collaboration and agility, negating the very advantages that public cloud environments promise to deliver.

Multi-Cloud Complexity

(Public) Cloud computing is often preached for its cost-efficiency, enabling businesses to pay for resources on-demand and avoid capital expenditures on physical infrastructure. However, the cloud paradox reveals that organizations can inadvertently accumulate hidden costs, such as data egress fees, storage overage charges, and the cost of cloud management tools. Without careful planning and oversight, the cloud’s financial benefits might be offset by unexpected expenses.

Why Cloud Migrations are Slowing Down

Failed expectations. The first reasons my customers mention are cost and complexity.

While the cloud offers potential cost savings in the long run, the initial investment and perceived uncertainty in calculating the total cost of ownership can deter some organizations from moving forward with cloud migrations. Budget constraints and difficulties in accurately estimating and analyzing cloud expenses lead to a cautious approach to cloud adoption.

One significant factor impeding cloud migrations is the complexity of the process itself. Moving entire infrastructures, applications, and data to the cloud requires thorough planning, precise execution, and in-depth knowledge of cloud platforms and technologies. Many organizations lack the in-house expertise to handle such a massive undertaking, leading to delays and apprehensions about potential risks.

Other underestimated reasons are legacy systems and applications that have been in use for many years and are often deeply ingrained within an organization’s operations. Migrating these systems to the cloud may require extensive reconfiguration or complete redevelopment, making the migration process both time-consuming and resource-intensive.

Reverse Cloud Migrations

While I don’t advertise a case for repatriation, I would like to share the idea that companies should think about workload mobility, application portability, and repatriation upfront. You can infinitely optimize your cloud spend, but if cloud costs start to outpace your transformation plans or revenue growth, it is too late already.

Embracing a Smart Approach with VMware Cloud

To address the cloud paradox and maximize the potential of multi-cloud environments, VMware is embracing the cloud-smart approach. This approach is designed to empower organizations with a unified and consistent platform to manage and operate their applications across multiple clouds.

VMware Cloud-Smart

  • Single Cloud Operating Model: A single operating model that spans private and public clouds. This consistency simplifies cloud management, enabling seamless workload migration and minimizing the complexities associated with multiple cloud providers.
  • Flexible Cloud Choice: VMware allows organizations to choose the cloud provider that best suits their specific needs, whether it is a public cloud or a private cloud infrastructure. This freedom of choice ensures that businesses can leverage the unique advantages of each cloud while maintaining operational consistency.
  • Streamlined Application Management: A cloud-smart approach centralizes application management, making it easier to deploy, secure, and monitor applications across multi-cloud environments. This streamlines processes, enhances collaboration, and improves operational efficiency.
  • Enhanced Security and Compliance: By adopting VMware’s security solutions, businesses can implement consistent security policies across all clouds, ensuring data protection and compliance adherence regardless of the cloud provider.

Why VMware Cloud?

This year I realized that a lot of VMware customers came back to me because their cloud-first strategy did not work as expected. Costs exploded, migrations were failing, and their project timeline changed many times. Also, partners like Microsoft and AWS want to collaborate more with VMware, because the public cloud giants cannot deliver as expected.

Customers and public cloud providers did not see any value in lifting and shifting workloads from on-premises data centers to the public. Now the exact same people, companies and partners (AWS, Microsoft, Google, Oracle etc.) are back to ask for VMware their support, and solutions that can speed up cloud migrations while reducing risks.

This is why I am always suggesting a “lift and learn” approach, which removes pressure and reduces costs.

Organizations view the public cloud as a highly strategic platform for digital transformation. Gartner forecasted in April 2023 that Infrastructure-as-a-Service (IaaS) is going to experience the highest spending growth in 2023, followed by PaaS.

It is said that companies spend most of their money for compute, storage, and data services when using Google Cloud, AWS, and Microsoft Azure. Guess what, VMware Cloud is a perfect fit for IaaS-based workloads (instead of using AWS EC2, Google’s Compute Engine, and Azure Virtual machine instances)!

Who doesn’t like the idea of cost savings and faster cloud migrations?

Disaster Recovery and FinOps

When you migrate workloads to the cloud, you have to rethink your disaster recovery and ransomware recovery strategy. Have a look at VMware’s DRaaS (Disaster-Recovery-as-a-Service) offering which includes ransomware recovery capabilities as well. 

If you want to analyze and optimize your cloud spend, try out VMware Aria Cost powered by CloudHealth.

Final Words

VMware’s approach is not right for everyone, but it is a future-proof cloud strategy that enables organizations to adapt their cloud strategies as business needs to evolve. The cloud-smart approach offers a compelling solution, providing businesses with a unified, consistent, and flexible platform to succeed in multi-cloud environments. By embracing this approach, organizations can overcome the complexities of multi-cloud, unlock new possibilities, and set themselves on a path to cloud success.

And you still get the same access to the native public cloud services.