Drut Webinar Drut Software Platform Install

Show video

Good afternoon and welcome my name is Simon  McCormack I'm product management for Drut   Technologies and today we're going to  talk about the Drut software platform   the DSP as we call it internally. DSP is a really  interesting bit of software that you can use for   building private Cloud type infrastructure  and it's the base software we use for Druts   photonic Fabric and disaggregated solution. What's  interesting is you can run it as a standalone   entity so it doesn't always have to come with  the photonic fabric it can be a building block   towards that. In the webinar today I'm going to  basically show an installation of the DSP and   show how smoothly we can deploy this platform  and the different components are part of it,   it's kind of an interesting thing to show a  deployment but we want to show how the bar   to getting a private Cloud setup can be very  low with a DSP type environment and it really   help you towards building your own private Cloud  environments. Before I go into a demo I want to   do a few slides to tell you a little bit more  about some of the components of the DSP you   know because there's a certain amount of Drut  technology plus some open source technology so   if you're going down that direction for the open  source then this could be the platform for you.  

I'm working product management so I spend a  lot of time talking to customers about their   problems and trying to build products so we  meet the problems that these customers have.   So I wanted to start with a little bit about  about what's driving the problems that we see,   what's driving us building this software because  you'll see then you will know what the direction   we're going and why we go in that direction, if  you think about some of the problems that we're   going to describe. So I start off with this first  slide, what are customers actually want   from a private Cloud type deployment. They're  all looking and you know you can say some have   different problems they all have different  problems Etc I say they're all looking for   essentially a cloud-like experience so they want  their users application deployment teams to have   a simple interface to deploy the machines  that they're going to use for their service   we also think we can extend that kind of concept  to all layers in your organization so you think   about people are involved in your data centers  they'll be low level infrastructure people   people deploying Cloud platform in addition  to the application deployers and the the users   Etc. We think they all deserve kind of a cloud-like  experience something simple for them to be able to   deploy what they need without worrying about the  underlying infrastructure that's a lot of what was   driving people towards I think the public Cloud  because that gave essentially a simple interface   to get into it so that's a one-core driving Factor  behind this software is to try to give you that   better experience. There's a number of other things  customers want you know, the list is endless but   I've got a few here there a lot of people are  very frustrated when they're purchasing and they   feel they're locked into Technology Solutions so  more and more customers we're finding they want   more standards-based flexibility in the software  Stacks that they're deploying which is where open   source becomes interesting because we've seen  a lot of maturity in the open source software   throughout the industry in in the last few years  and I'm going to demonstrate some of that as I   deploy the software today obviously they still  have high reliability and performance needs ha   is still high none of that changed they want  control of costs so one of the problems we've   seen is customers getting unexpected bills in  in the days that they are today and they also   want a mix of public and private type services  so that in most cases there's no concept that   you're all public or you're all private you  wanna they want a little bit of a mix of the   two and security is very important to them so  there's some I've got six kind of big bucket   items about requirements here and the big question  is why does public Cloud not satisfy on its own   and what are you what can you do with private  Cloud to meet some of these requirements   so my next slide I talk a little bit about some  of these needs in the public Cloud space what we   found is that you know it's continuing to be more  expensive than people thought and they're getting   unexpected bills which are ever increasing and  they find themselves essentially locked in once   you've made that effort to move something to the  public Cloud it's very difficult to bring it back   and as you make it some kind of hybrid people  do have security concerns obviously if they've   got their own private data you don't necessarily  want to put it in the public cloud and some would   say just won't mind public Cloud it's just  too much too big too difficult and there's   so the result is in general whatever happens  you have to maintain your own private clouds   going forward I think there's a lot of Industry  talk over the last number of years particularly   the analysts talking about everything moving  to the public cloud and obviously that never   made sense I think the private cloud data center  industry was continuing to increase quicker than   the public Cloud well but all the Press was about  public Cloud because obviously that was starting   from a very small base I think ultimately you'll  you'll always see both together but you have to   manually maintain your own private Cloud to keep  your business going up but I mean people still   have problems with the private Cloud especially  today and the climate today it's you know private   cloud is not cheap there's a lot of costs there  Power Cooling particularly with the new GPU type   loads that are ever increasing number of power  ever increasing need for cooling that equipment   and not just withstanding the equipment that you  need to purchase for these environments as well so   it's all very expensive very complicated because  there's lots of components from lots of vendors   you can put together and not everybody has the  Personnel to to do this either I mean a lot of the   times people have cut back on their resources so  it's very challenging and I think we think there's   a change in the the traditional monopolithic data  center for Enterprises so the hyperscale has got   these massive bigger and bigger data centers we  know that but for you as an Enterprise the type   of data centers you have today may be different  your traditional data center they used 10 years   ago may not have the power and cooling for modern  equipment making putting it all in one place may   not even work so a Colo space distributed data  center makes a little bit more sense so you may   have little pockets of data centers all connected  with high-speed connectivity to make it all work   and I and I think this This lends itself well to  the Druid solution because I want to be able to   then deploy easily a new little private cloud and  that's what I'll show you in this demo today when   I go through the product enough about requirements  what's Driven us here I think it's the the primary   thing message that I wanted to give there was  basically you know there's a continual need for   private clouds in private data centers they may be  more distributed so you need small little pockets   of connectivity and Drew actually has software  that I think we can help you help you with this   kind of situation so the bottom line number  one thing is that we've got software that we   can build for you we can manage it for you but we  want to leave you in control so you're using a lot   of open source gives the control to you because  that open source you continue to take forward   so what we've done at Drut is because we've you  know we've built why we built this software stack   in the first place is an interesting thing to  consider with the Drut hardware we had to build   enough software to be able to use our Hardware  right the base level it's an infrastructure type   software that we built anyway and what we found  is just that infrastructure software wasn't   enough to validate our use case so we built more  software on top of it more layers of software to   prove an application works in a droot hardware  environment so we built the software anyway for   our own purposes we showed that to customers  customers purchase the software and we realize   we need to productize this solution because  there's a lot of value to what we're providing   so I'll talk about the different layers of stacks  of software that we have in the solution my charts   here I'm trying to describe in the middle there  you see Drut software platform that's really   the underpinning software stack that we can use  to bring all this together and then below that   fabric manager which is essentially optional and  then on top of its services I'll talk about why   that's there in a moment start off with the Drut  software platform we basically have a hook into   work or infrastructure and Hardware so if you've  got data center machines you need to manipulate   them at a hardware infrastructure level bare metal  type level to be able to manage these machines   give them addresses Etc so it's a bare metal type  structure you need software to do that the DSP can   do that for you so you can purchase this solution  in these different layers so you could just use   the DSP to do two basic infrastructure of your  compute environment with lends yourself in the   you should be able to add group fabric as well but  you can continue on just using the root software   the next layer we optionally put on is a that  platform layer hypervisor containers virtual   storage so we've got the Drut software platform  experience where we can simply deploy this   containerized or virtual machine or virtual  storage solution together which is what I'm going   to show as part of this demo in fact I'm showing  one and two essentially today the top bottom two   layers now the next layers Cloud lock experience  have some workloads become very customizable per   install that's where Services come into it I think  everybody every customer dreams of a cloud-like   experience their services what they don't really  realize up front it's very difficult to do that   for a vendor in a generic fashion it needs to be  customized to you so you need services to do that   to really pin it all together we can show examples  and base structures but you need something   customized same for apps and workloads which is  why we have that layer built on top of it all   so the components we're typically deploying in  the DSP so there's a prime layer infrastructure   layer to DSP which is typically fixed towards  providing pixie Services image Services DHCP DNS   networking power management for devices and also  moves into the desegregated GPU type environment   but then we have options on top of it if you want  to do Cloud platform software on top with us then   we can take you down this journey of doing the  open source kubernetes openstack type environment   you can choose different ones  and you can do it yourself   so I mean we can give you a base platform that's  basically the hardware uh deployed and networking   figured out you can then start deploying your own  software on it or you can let us do the next level   which is the openstack and kubernetes piece and  then when you're doing something like openstack   you've got to have a storage solution that kind  of go together you don't really have one without   the other at this point so Ceph is the storage  solution of choice for us although we can be   flexible the model we've built allows plug-in  modules to be able to be inserted and you can   do different Technologies depending on what  you want to do on the left hand side you see   networking because we have a certain amount of  networking that will build into the solution a   basic VLAN configuration subnet configuration and  I'll show you the little bit of this in the demo   the right hand side the cloud experience is  obviously you as I said the services would tend   to do that and then workload application services  so my goal today in this demo it's primarily an   installation piece I want to show you how we  built a Playbook solution to be able to deploy   the different applications once they're up I do  a little troubleshooting session I wanted them to   show how to get it up and running smoothly and as  I made some let's call them intentional mistakes   in my deployment to show how to troubleshoot  the deployment mistake we're going to cover more   details on these applications in future sessions  so I'm going to do a deep dive on Ceph do a deep   dive on openstack we can do the cloud experience  portal and the other pieces I think as we go on   so zooming into the demo wise and I what I've done  this time is because I'm doing deployment it takes   more than a couple of minutes sometimes to do  deployment of the software and we don't want to   all sit here where I did it live so I did it in  advance and videoed it and I can play my little   videos but I want to start off with talking  about you know what happens in a data center   type deployment typically most data centers are  already there they've already got this function   called management control which is the DHCP PXE  boot image Services already in the data center   they already got that solution this solution here  comes with its own management control solution so   how does that work well we want to be able to  mix the two together nicely so they can play   well together and still be able to offer these  different Services now if you listen remember what   I said earlier on in the presentation a lot of new  data center deployments are little remote colos   what's interesting the original colos because  I've done some of these you turn up the cage the   cage has a rack there's no server in there for  you to deploy your management Control software   on Twitter there is an existing function so you  have to basically sometimes bring the management   Control software and put it together so you almost  have like a check in an egg situation where I got   to deploy my management Control software but I  need something to deploy it but there's nothing   there to deploy it with in this case you can use  a USB stick pre-install it on a machine or in some   cases we use a laptop to deploy the first machine  the da the demo here actually I did a video of me   deploying the basic machine just gave me a Linux  machine on a bare metal that's all I did so I I'm   not going to even play that video because it  just imagine I start this whole thing I've got   one Linux machine on bare metal and I'm going  to use that to bootstrap this opens environment   first picture I'll go on to is actually  showing it's a simplified view I've cut a   lot of it out to try and simplify it so I've got  my control management software which is my DSP   that's just a Linux machine at the moment and it's  connected to my management switch not worrying   about storage or data networks at the moment  because we're just trying to get it deployed   we can talk about all that in another session as  well I think that's probably worthy of an hour   just talking about the networking pieces to plug  all this together so here I've got a Linux machine   and I want to deploy the DSP so I'm going to go  into my video or before I do my video I want to   talk about the broadcast domain so you'll see  I've got a the way we set this one up is I've   got my yellow broadcast domain and my kind of  Bluey green broadcast domain I got two separate   domains because I imagine I'm plugging this  into an existing data center they've already   got management control functions so I basically  have that on a separate broadcast domain to my   new function because I want all my new machines  to boot up this dsb so when they come up they'll   go pixie boot anybody out there and they'll hit  my DSP they won't hit the Legacy set up now in my   lab I have hundreds of these management controlled  devices they're all over the place which is why we   we separate it like this you don't necessarily  drop hundreds in your places let's go to the   video and we can see what he's doing and I and I  speed some of it up so basically I've gone in I'm   actually running an ansible Playbook that it's  the configuration and manipulation installation   of these machines so it's basically it's I'm on  this Linux machine I've deployed I've downloaded   a file that Drut gave me and I run this Playbook  throughout the rest of the session today I don't   need to manually download any more files it's  going to grab everything for me so it makes it   really easy I just grab one file put it on this  Linux machine and then run it and it's now doing   the management control installation and setting  me up for the rest of it I'm gonna be able to set   an openstack from this system if you've ever done  sap and openstack you're trying to grab packages   from all over the place installing different  things figuring out the versions there's a lot of   complexity behind it I I like the way they've done  the Open Source Products and the so many modules   you see this install is finished which is great I  like the way they've done modularized it but there   can be a lot of different components because not  everybody needs the same things so we've curated   this and put it all together so this is me logging  into our DSP product for the first time now   so I logged in as droot and it's saying DHCP is  not enabled welcome to the solution I'm wanting   to answer a bunch of questions in the the basic  splash screen and then I can go into some of   the networking setups and do things here your  noted says a bunch of networks already set up   so what it does is actually read from the Linux  machine so the vlans and networks that were set   on the machine it pulled them in and basically  configured it I think there this solution that   we have the Playbook solution is its first  version we're going to continue to improve   it one thing I did is I went on this machine and  created these networks and then plumbed it and   then it read they read them in here automatically  what I'd like is to be able to go in here and add   it and it push you to the machine make it go the  other way around which is definitely within the   bounds of possibilities on this and there's a  bunch of other little things that you do but   I've basically set up my uh my system that can  do the deployments now so I can move to the next   setup so in this world I've got to do openstack  and I'm going to do Seth I want to set first get   the storage layer built then I'll do openstack and  then it'll integrate the two together running from   these playbooks and I don't install any more  software it's going to do it all for me grab   whatever it needs got all the packages there  anything it updates it will grab automatically   so this setup now if you see I've created I've  got these three new machines My Demo DSP five six   and seven they're connected to three of the the  yellow ports so the graphic is a little bit funny   but there trust me they are connected to those  yellow ports so that those machines when they   turn on can be detected by the DSP so they'll  beat they'll Boot and they'll do a PXE boot   and it'll detect them now if you've got a server  that's not set for pixie boot you need to set it   it's a little pixie boot out of this port changing  the BIOS story but once you set it so it picks a   good out of the report it'll be captured and it'll  be controlled in this environment you didn't even   install an OS on these machines the pure bear mode  you got straight from the factory let's watch the   install left hand side you'll see actually I've  got here's my machines on the right hand side   I'm running my CEPH Playbook so the set Playbook is  going to basically start grabbing the machines now   in this case I told CEPH which machines to grab  five six and seven I told it I said well maybe   I want for the cloud like experience I don't want  to tell you which machines to get this layer we're   showing you is layer below cloud like experience  Cloud experience as a user says just give me the   machines I Don't Care What machines and that that  layer would program this layer to do the actual   installation so here I say deterministically do  these machines so it's grab these three machines   you've seen on the left powered them on once they  come on it's going to install the ceph software   and then it's once it's installed the software  and got the packages right it's going to do   the configuration and configure the ceph cluster  together it's right now it's waiting see it's it's   going a little bit black screen and I sped some  of this up this is not not doing this in real time   because really been waiting a long time but it's  waiting for these machines to come up uh to be   active and then it'll start doing the next session  so the Playbook is pretty smart it knows what's   out there connect to things it can wait based on  looking for a status if it if there's a problem   or an error it gives sensible errors I've had  situations where I told it the wrong machine like   I said dsp-8 and there isnt an eight it just came  back saying you don't have an eight um and then   you have to go and fix it and you can just rerun  the The Playbook so it's found the machine they're   all in the right State now so they're all in the  right now so now it's installing the software   so all I did to get to this point is I  answered some questions basically to prompt it because I had to tell at the machines  and a couple more questions were answered so   from the user perspective it's pretty simple  answer a few questions run the script and it   does the deployment for you really removing a  lot of the headache for getting these kind of   services installed uh the way they were before  so step is there and up and running and I know   we've already had this question in in one  of the in the Q and A's that we can see so   I think some clever person said well how do you  know it's deployed right well and actually in   this case I didn't deploy it right I made a  mistake you do need to go in and have a look   and run through validation steps to make sure it's  correct um we can do we've done a certain amount   of that automatically but I've done something  that only ceph could tell you so actually I   didn't mean to do that I go actually to click  on the next slide still didn't mean to do that   idea there we go I found it I go to the next slide  is it working so let's run this one because this   will take us through an interesting little ride  about whether this is working this is me logging   in for the first time and I'll prove it because  it makes me change my password uh when I log in   I'm going to try and figure out you know what's  the health status and how can we go about getting   the help the status of the system uh to be  uh healthy and everything working correctly   so I just wonder I'm thinking out loud is that our  installation script can do a series of automatic   checks at the end of it and I think that would  probably be a good thing just wait there and   then come it comes back and tells you but in this  case today I logged into the dashboard you see the   dashboard's giving me a warning if you see the  charts in the bottle it's talking about the PG   status it's kind of inside baseball stuff a little  bit about Ceph um basically you want to clean up   the the placement groups these are actually it  says placement group positioning and too few   osds so the osds are the disks are on the setup I  literally gave this set up two disks, I got   three servers I gave it two disks and the minimum  it needs is three disks normally you'd have dozens   of disks you can have lots of service lots of  disks not so maybe I wanted to see if I could   break it a little bit and it's also an interesting  exercise to see about fixing it now when I did   this when I did this over I I had a bunch of you  see it's only used two systems not three as well   so I've got three servers I only use two for disks  so I went about actually the first thing I do is   add more disks so I went to my service and then  this screen basically it's showing us OSD one   six and seven is OST one this is the disk thing  we're going to make that for two later because   I'm going to go to the system add more disks and  this one is complaining about the PG status it's   undersized pgs placement groups which is where  it puts all the data across this F data store so in in general I don't care about that stuff so  now here I've gone to my machine I've added some   disk and you see it's got a third disk there  actually if I pause it a second you see this   this third disk has been added in already state  so obviously adding disks is very dependent on   the uh the hardware you have whether  you can hot insert them or what   these doubt particular Dells I was using uh hot  insert them but I but I had to uh this machine I   have to reboot it and create a virtual Virtual  Drive on the system and so having done that and   rebooted it jumped forward a little bit it's  basically uh added the new OSDs automatically   the problem was that actually I I still had a  replication factor is too high said replication   factor is three I've only got two machines that  have got OSD so I actually changed it all to two   so that was my next step so my I didn't have  enough thought placement groups or OSDs and I   had the replication factor too much now you  see I cleaned it up and it's all gone it's all   gone clean so replication Factor two two copies  of everything PG status is now all clean go back   to the dashboard and everything's good so that  was a quick method that I went through to to try   and troubleshoot it I think what I want to do for  surface I'll do a deep dive on this where we go   through this in intricate detail because there's  a lot of interesting use cases you can go through   with uh with your data and volumes and machines  and the high availability of the whole environment   etc etc that is very interesting I just wanted  to give you a feel so you think about from that   that install where I did the the video  where I did the install for the Playbook   that got me here all I did was answer a couple of  questions and it took me through the install when   I got to the next step so it's very easy to get  into that um position from the software and I only   downloaded the one package from Drut to Kickstart  the whole deployment and I know I was slightly   less on my discs but I got that installed up and  running now the next step I want to do openstack   so if you notice I've got three more machines in  my picture three open stack machines now so I'll   do the Playbook deployment of openstack and then  it's going to do an integration with Ceph bring   the two together at the end of that I'll be able  to create a VM in openstack using the data from a   volume from the ceph setup so I've basically got  a fully functioning setup at that point and all   I'm going to do is run another Playbook so here  I start off by actually looking at these machines   so these machines so I looked at the machines  I'm going to do for openstack I needed to tweak   some networking for them because I want to do  a little bit of network change to give the VMS   access to the internet because we've sort of got  external facing VMS now I'm doing them in the DSP   UI this is before machines are even turned on so  I'm kind of prepping it which is a nice function   I I then pointed my questions that I did on  the for the Playbook I then configured locally   um and the two are synced up together I think  an enhancer here is I can make one do the other   so if I do this it should push that answer that  question on the system for the installation and   vice versa that's kind of the direction I'd want  it to go now I run my setup openstack uh Playbook   and it'll go off and do the same thing as as it  did before similar thing it's going to grab the   machines on the left powers them on deploys Ubuntu  in this case that's deploying the OS from scratch   all I did was give it some bare motor machines  deploy Ubuntu and wait for it to come back it's   it's happy with the machines come back I was  really quick on the fast forwarding this time   and then it's doing the next Playbook so  it's basically got a series of playbooks here   and the next Playbook is basically installing all  of the openstack components and there's a with   open set there's a lot of different components  you can choose continue to add and change them   as you go forward so there's a lot more pieces  that it adds to the configuration once it's done   that configuration then it completes it by doing  an integration so making the volume from Ceph   available Within openstack so that as a user I can  create a machine and from that machine I want to   give it storage it's going to use storage from the  shared Self Storage environment I'm really making   it simple for the openstack operator and giving  you the basis providing that cloud-like experience   to the user so they can do so they can create the  machines and starting sorting applications as the   one they want so you see there's a lot of tasks  that's running through in this particular setup to   get all this configured and put together so really  as a huge Time Saver for an operator to be able to   do this now we have additional playbooks to do  more functions so the next thing you say what if   I want to add another server tomorrow I actually  got a playbook for adding a host and I think I'll   show that again in a another session because those  this first time installs are good this is a good   basis to show you can get up and running sometimes  you can get away with more complicated installs   sometimes they'll give you a week to do the  install if you're doing that in a Data Center and   it's finished it's set up now sometimes I'll give  you a week to do the install so you've got time to   get it up and running once you're up and running  in and in production or close to production those   luxuries are gone you don't have a week to do  anything now now you've got short windows so   those are the operations that are actually more  interesting than the base installed but I wanted   to do this one first because I want to show you  how how easy it is to get them running with the   DSP now I've got this software base that I can  start building on top of and doing a lot more with if I click the screen like that always takes  me to back to the the back to the video again   so in summary I think what we want to describe  off for for droot is uh we're really more than   just a hardware component company we've got a  whole software solution private Cloud stagger   software which is all optional pieces that you  can build in to offering cloud services for your   environment depending on your needs we probably  got the software and we definitely have a lot   of the hardware for you to be able to build a  modern private Cloud that sets you up for the   future and in a lot of cases uping open source  standard type software that really puts you   in control and not so vendor locked into other  Solutions well that's all I had time for today   um thanks very much for joining um I will  see you again in a future session goodbye

2023-09-25

Show video