Programming the Network Management Stack for Cloud Datacenters

dc.contributor.advisorChen, Angen_US
dc.creatorHsu, Kuo-Fengen_US
dc.date.accessioned2022-09-23T16:01:41Zen_US
dc.date.available2023-02-01T06:01:19Zen_US
dc.date.created2022-08en_US
dc.date.issued2022-06-07en_US
dc.date.submittedAugust 2022en_US
dc.date.updated2022-09-23T16:01:41Zen_US
dc.description.abstractDatacenter networks are the foundation of cloud services. The increasing demand of these services leads to challenging requirements to the network management, especially in security, performance, and reliability. However, managing the network to satisfy these requirements is difficult, as datacenter networks used to contain many opaque, vendor-specific components (e.g., proprietary switch hardware and software). The management stack only has limited control over these components, making optimizations hard to achieve. In recent years, datacenter networks have become more open and programmable. This trend gives rise to the possibility of programmatic control from the management stack, with a more precise and customizable control loop. In this thesis, we leverage this trend to systematically improve the network management stack, rethinking how security, performance, and reliability goals can be better addressed programmatically. First, to address the unique security issues due to the emerging Remote Direct Memory Access (RDMA) hardware, we developed Bedrock. By co-designing RDMA hardware with programmable switches, Bedrock significantly enhances RDMA security without sacrificing its native performance. Second, to achieve high performance, granular load balancing in datacenter networks is required for to operate at high utilization. We built Contra to implement performance-aware routing protocols in a distributed manner. By analyzing the network topology and the user-provided policies, Contra can automate the generation of routing protocols for programmable switches to enforce the policies at hardware speeds. Finally, high reliability is crucial to large networks, and it is again up to the management stack to achieve this goal. We proposed Occam, a system that exposes a shim layer of APIs for network management tasks. With the restricted but expressive APIs, our algorithms on scheduling and rollback plan generation can provide better transaction semantics for network management tasks and prevent conflicts between them. In all three cases, we show how the network management stack can be drastically improved with programmability.en_US
dc.embargo.terms2023-02-01en_US
dc.format.mimetypeapplication/pdfen_US
dc.identifier.citationHsu, Kuo-Feng. "Programming the Network Management Stack for Cloud Datacenters." (2022) Diss., Rice University. <a href="https://hdl.handle.net/1911/113223">https://hdl.handle.net/1911/113223</a>.en_US
dc.identifier.urihttps://hdl.handle.net/1911/113223en_US
dc.language.isoengen_US
dc.rightsCopyright is held by the author, unless otherwise indicated. Permission to reuse, publish, or reproduce the work beyond the bounds of fair use or other exemptions to copyright law must be obtained from the copyright holder.en_US
dc.subjectNetwork managementen_US
dc.subjectProgrammable networksen_US
dc.subjectDatacenter networksen_US
dc.titleProgramming the Network Management Stack for Cloud Datacentersen_US
dc.typeThesisen_US
dc.type.materialTexten_US
thesis.degree.departmentComputer Scienceen_US
thesis.degree.disciplineEngineeringen_US
thesis.degree.grantorRice Universityen_US
thesis.degree.levelDoctoralen_US
thesis.degree.nameDoctor of Philosophyen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
HSU-DOCUMENT-2022.pdf
Size:
3.98 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 2 of 2
No Thumbnail Available
Name:
PROQUEST_LICENSE.txt
Size:
5.84 KB
Format:
Plain Text
Description:
No Thumbnail Available
Name:
LICENSE.txt
Size:
2.61 KB
Format:
Plain Text
Description: