Entries categorized as ‘ESX 3.5’
Not so long ago I posted a link to a VMWare blog in which beacon probing was demystified. This article stated that you only should use beacon probing when there is no link state tracking on the physical switches and you could consider beacon probing as a nice software solution for replacing it.
Well we’ve got our ESX environment set up by a local supplier and they advised us to use beacon probing instead of link state tracking. But for some reason beginning from that moment I got major events from my ProCurve switches stating excessive broadcasts. This wasn’t often, but especially during peak hours I was getting this notification.
When I started sniffing the network packets on the uplink of the switches I noticed what kind of packets it were. It was an almost continuous flow of RARP packets coming from the ESX servers. RARP packets are meant as MAC address table updates for switches. This way when a node is suddenly available on a different MAC address the switch already knows the new path. This is also what happens when a virtual switch detects a link is not functional. It will switch the uplink and notify the switches. When beacon probing isn’t working as expected ESX constantly thinks the uplink isn’t functional so it is constantly switching the uplink and as a result constantly sending out RARP packets.
While you can define both the notify switches parameter and the network failure detection, it isn’t good to just put notify switches to ‘No’. This way the failover is still constantly changing uplinks and this can result in errors (in my case timeouts with TFTP). The real problem was the failure detection. Beacon probing for some reason just didn’t work in our environment. When I changed the failure detection to ‘Link Status only’ all the RARP packets disappeared and my excessive broadcasts were gone.
In my case just link status only is sufficient, but I can imagine there are cases where you would want to use beacon probing. If you enable beacon probing and this results in excessive broadcasts (or just more broadcasts) I do advise to look if you could find those RARP packets. This can indicate that beacon probing is just not working correctly in your environment.
I want to thank Scott Lowe for giving me a push in the right direction.
Categories: ESX 3.5 · HP · Networking · ProCurve · VMWare · vSwitch
Tagged: beacon probing, ESX, ESX 3.5, Networking, ProCurve, Switches, VMWare
As the new year begun some servers were due to be moved to another place more permanent burial place. Our ‘expensive’ SAN storage space is too small to keep the old servers on it so it was time to move them to a USB harddisk. But I had no idea how I could attach it to the ESX. Well for those of you who want to know for the future here is how I did it.
Well first of all I hooked the USB drive onto the ESX server (HP Proliant DL380g5). When I logged onto the console the first thing I did was execute cat /var/log/messages. This gave me following lines:
kernel: Initializing USB Mass Storage driver...
kernel: usb.c: registered new driver usb-storage
kernel: scsi3 : SCSI emulation for USB Mass Storage devices
kernel: Vendor: WD Model: 10EAVS External Rev: 1.05
kernel: Type: Direct-Access ANSI SCSI revision: 02
And a couple of lines further:
kernel: VMWARE: Unique Device attached as scsi disk sdc at scsi3, channel 0, id 0, lun 0
kernel: Attached scsi disk sdc at scsi3, channel 0, id 0, lun 0
kernel: SCSI device sdc: 1953525168 512-byte hdwr sectors (1000205 MB)
kernel: sdc: sdc1
kernel: USB Mass Storage support registered.
From this information I concluded that the USB drive was recognised and was emulated as SCSI, hence the sdc. My harddrive already contained a NTFS partition, but ESX doesn’t have the NTFS drivers. Thats why it will mount the partition using FAT32. This would pose a problem because FAT32 doesn’t support files bigger as 4Gb and my vmdk’s are much bigger. So I decided to format the disk with EXT2 so I could move the files.
I first deleted all the existing partitions with fdisk and after that I created a new primary partition with partition type Linux. After formatting it with the command mke2fs /dev/sdc1 I had a working Linux partition. If you by any chance would want to have an EXT3 disk you could use mke2fs -j /dev/sdc1.
So now I got a working disk and a working partition. First I needed a mount point before I could actually mount the USB drive. So I made a directory in /mnt called ‘usb’ with mkdir /mnt/usb and I executed the following command:
mount /dev/sdc1 /mnt/usb
Now I could access the disk at /mnt/usb and copy all the old VM’s from /vmfs/volumes/[datastorename]. If you want to copy entire directory’s make sure you use the '-R' parameter with cp.
Categories: ESX 3.5 · HP · Linux · VMWare
Tagged: back-up, ESX 3.5, ext2, ext3, fdisk, mke2fs, mount, USB
For some time I wanted to virtualize the only non-Windows server we had. It was a Red Hat Enterprise Linux 2 server serving some old legacy application. This server had to be preserved en kept running for archiving purposes. So instead of letting it run consuming power and hope it wouldn’t fail (+-7 year old hardware?) we wanted it to be virtual. This way we weren’t reliable on old hardware and we had a flexible way of turning it on whenever we needed it.
With great enthusiasm I went to the datacenter with the VMWare converter CD and I performed a quick and smooth virtualization of the machine. However when I started the machine I got kernel panics and notifications of partitions that couldn’t be found.
Well after quite some time (it wasn’t the highest priority) and a lot of searching I finally found a blogpost called Virtualize a Linux Server with VmWare Converter 3.0.1. “This is exactly what I am looking for!” was my first thought. Well although the article is really clear (well done Manlio Frizzi) it helped me only a bit further.
So my first steps were those that Manlio Frizzi described on his blog. Virtualized the system to my ESX environment. After i’ve done that I downloaded the rescue iso of Fedora Core 3 (FC2 wasn’t available) and mounted this in the virtual cd-rom drive. After selecting “Rescue installed system” from the bootscreen it started loading the BusLogic driver. Then I selected English as my language and US as my keyboard lay-out.
(more…)
Categories: Converter · ESX 3.5 · Linux · VMWare
Tagged: BusLogic, Converter, dmesg, eth0, fstab, grub, Linux, RHEL, VMWare
In part 1 I covered some NIC operations from command line. In part 2 I will cover some standard virtual switch tasks like adding and deleting a virtual switch and making and configuring PortGroups. So here goes for virtual switch operations…
Listing all virtual switches
esxcfg-vswitch -l
This commands gives you a list of all the configured virtual switches with their PortGroups and connected uplinks. Further more a lot of properties are shown about the vSwitches and PortGroups. For vSwitches it shows among other things the name, the uplinks, the number of used ports and the number of configured ports. For PortGroups it shows the PortGroup name, VLAN ID and uplinks.
Add a virtual switch called ‘TestSwitch1′
It’s really simple to add a virtual switch to an ESX server. You simply use the following command:
esxcfg-vswitch -a TestSwitch1
This creates a virtual switch with the name ‘TestSwitch1′. It still has no PortGroups and it has been set with the default amount of configured ports (64). To see all the properties use the command provided earlier to list the virtual switches. If you want to specify the number of configured ports you can use the following command:
esxcfg-vswitch -a TestSwitch1:16
This gives you a virtual switch named ‘TestSwitch1′ with 16 configured ports.
(more…)
Categories: ESX 3.5 · Networking · VMWare · vSwitch
Tagged: CLI, ESX, ESX 3.5, esxcfg-nics, esxcfg-vswitch, Networking, NIC, pNIC, VMWare, vSwitch
As I was looking around in the command line interface (which is pretty new for me) I came around the esxcfg- command set. In particular the commands to manage the NIC’s (part 1) and the vSwitches (part 2) raised my interest. I decided to explore a bit further and write down how to do some standard actions. So here goes for NIC operations…
Listing all NIC’s
esxcfg-nics -l
This commands gives you a nice list of all the available NIC’s and all their properties. Those properties include name, link, speed, duplex and description.
Setting a specific link speed and duplexity of a NIC
The thing I want to do here is set my ‘vmnic3′ (the name I got from my previous command) to a speed of 100Mbps and I want to set it to full duplex. The command to do this is:
esxcfg-nics -s 100 -d full vmnic3
The ‘-s’ parameter defines the speed. This parameter can hold the values 10, 100, 1000 and 10000 respectively defining the speed to 10Mbps, 100Mbps, 1000Mbps and 10000Mbps.
The ‘-d’ parameter defines the duplexity. This parameter can hold the value ‘full’ for full duplex and ‘half’ for half duplex.
Setting link speed and duplexity of a NIC to automatic detection
To set my ‘vmnic3′ back to automatic detection I use the following command:
esxcfg-nics -a vmnic3
The ‘-a’ parameter simply sets the link speed and duplexity of the NIC back to automatic.
I hope this was useful to someone. At least I got better understanding and a little reminder for myself how to do these things. In part 2 I will cover some standard networking tasks considering virtual switches using the command esxcfg-vswitch.
Categories: ESX 3.5 · Networking · VMWare · vSwitch
Tagged: CLI, ESX, esxcfg-nics, NIC, pNIC, vSwitch
… you are using those switches for routing. Great so the only option for improving my networking environment for load balancing my ESX environment was crushed by these two HP documents: Switch Meshing and LAN Aggregation Through Switch Meshing.
Apparently there is a whole list of requirements for switch meshing. This list is most clearly defined in Switch Meshing (starting from page to 5 and ending at page 7). Just a short beginning of the list:
-
A meshed switch can have some ports in the meshed domain and other ports outside the meshed domain. That is, ports within the meshed domain must be configured for meshing, while ports outside the meshed domain must not be configured for meshing.
-
Meshed links must be point-to-point switch links.
-
On any switch, all meshed ports belong to the same mesh domain.
-
A switch can have up to 24 meshed ports.
-
A mesh domain can include up to 12 switches.
-
Up to five inter-switch, meshed hops are allowed in the path connecting two nodes through a switch mesh domain. A path of six or more meshed hops between two nodes is unusable. However, in most mesh topologies, there would normally be a shorter path available, and paths of five hops or fewer through the same mesh will continue to operate.
-
….
I have four HP ProCurve 5300 series switches and they all do routing. Switch meshing was my only option left after exploring NIC teaming articles from Scott Lowe and Lukas Kubin. Those articles describe perfectly how you could do NIC teaming and VLAN trunking on one ProCurve switch, but not how to do it with two of them. After some searching the only option to accomplish what I wanted was a HP patented technique called switch meshing.
So this is not an article explaining how to configure switch meshing for use with ESX, but to warn people for all the requirements. If your environment complies with all the requirements made for switch meshing the two documents named above will do sufficiently as far as it goes how to configure and what switch meshing exactly is.
If I ever end up in an environment where I am able to configure switch meshing I will certainly post how I did it. During my search for what switch meshing was I found out about another technology called XRRP. With that technology at least I can make a really good failover for my gateways, but later more on that.
Categories: ESX 3.5 · HP · Networking · ProCurve · VMWare
Tagged: ESX, HP, ProCurve, Switches
Not so long ago (14th of December) Mohammed Fawzi posted a blogpost called Hyper-V vs VMWare. In this post he wanted to put together a comparison of Microsoft’s Hyper-V and VMWare’s ESX (Enterprise). Although I don’t think he accomplished what he wanted to accomplish with that post (comparing the two hypervisors), he managed to accomplish something. And that something is what is not in the article itself, but in the comments beneath it. It started a really firm, mostly technical and sometimes even emotional, response of the community that he was comparing apples with grapes and doing it all wrong.
The replies didn’t even stop at Mohammed Fawzi’s blogpost, but even resulted in new articles stating the faults he made when he compared the two. The writers (Scott Lowe and Jason Boche) of those articles are writers whose blogposts I read with great care and are (in my opinion) really interesting.
This rumble in hypervisor land remembered me at the console wars where fanboys were flaming that their console is the best and the rest of the gaming consoles were rubbish. Although the comments and articles really don’t represent the behaviour of those fanboys flaming about their favourite console, I recognised some of the emotions that appeared and how heavily the community reacted on it.
I myself work a lot with HP equipment and use VMWare ESX as my hypervisor and I think it was the best solution in my situation. Just as with gaming consoles I believe the best hypervisor is the one that fits best in your situation. And above all I like the idea of more than one choice in hypervisor land. It will keep everyone awake and new functions will be developed to have that one unique selling point.
What the community doesn’t need is a break because there isn’t a trustworthy datasource. What the community does need is a factual datasheet comparing the two so that when we have to make that choice or when we have to write that business case we all have the facts. I like to know what Microsoft is doing and how it solves problems with their solution as well as how VMWare does those things. I hope I can learn from that and one day I can use that to my advantage and solve a problem or make the correct decision using all my knowledge.
The only question that remains is: Who is going to make that factual datasheet that is trustworthy?
Categories: ESX 3.5 · HP · Hyper-V · Microsoft · VMWare
Tagged: ESX, Hyper-V, Microsoft, VMWare
December 29, 2008 · 1 Comment
I was browsing through the VMWare blogs and I ran into the VMWare networking blog. They posted a really nice article about when to use beacon probing. Although it was already posted on the 10th of december I thought it was worth mentioning.
Beaconing is one of those features that often confuses even the most experienced networking admin.
Shudong Zhou, one of our senior engineers, recently posted an entry on the internal blog explaining how it works and how you might use it. He gave me permission to cut and paste his entry. Here it is …
Read the rest of the article here at the VMWare Networking Blog
Categories: ESX 3.5 · Networking · VMWare · vSwitch
Tagged: ESX 3.5, Networking, NIC teaming, VMWare, vSwitch
Today a power outage hit our datacenter for several hours. This was too long for the UPS so everything was shut down since we don’t have any backup generator or something like that. However when power came back on the ESX hosts booted faster as our storage controllers. I logged into VirtualCenter and I saw that none of my virtual machines were powered on automatically. They were greyed out with “(inaccessible)” behind the names. I googled, but couldn’t find anything related to my problem.
When I looked at the ESX hosts they weren’t showing my SAN datastores. So I went to my commandview EVA server and looked if the LUN’s were there. In fact first they weren’t there, but after a while they finally appeared. I turned back to my VirtualCenter and looked if the datastores also would appear automatically again, but that apparently was too much to ask. Not knowing what to do I started looking around what I could do. I looked at the storage adapter configuration of one of the ESX hosts and the HBA’s were nicely listed, but still no LUN’s. Finally just before I wanted to reboot the entire host I discovered I could rescan the HBA with right clicking on the HBA and then click “Rescan”. After the rescan was finished my datastores returned and I was back in business.
For the people who like to do everything at the commandline you can use the command:
esxcfg-rescan vmhba[0...n]
Categories: ESX 3.5 · HP · SAN / EVA · VMWare · VirtualCenter
Tagged: commandview, ESX, EVA, SAN / EVA, UPS, VirtualCenter, VMWare