Skip to content

School Of SRE Conclusion

Home
Level 101
Level 101
- Fundamentals Series
  Fundamentals Series
  - Linux Basics
    Linux Basics
    
    Introduction
    
    Command Line Basics
    
    Server Administration
    
    Conclusion
  - Git
    Git
    
    Git Basics
    
    Working With Branches
    
    Github and Hooks
    
    Conclusion
  - Linux Networking
    Linux Networking
    
    Introduction
    
    DNS
    
    UDP
    
    HTTP
    
    TCP
    
    Routing
    
    Conclusion
- Python and Web
  Python and Web
- Data
  Data
  - Relational Databases
    Relational Databases
    
    Introduction
    
    Key Concepts
    
    MySQL
    
    InnoDB
    
    Backup and Recovery
    
    MySQL Replication
    
    Operational Concepts
    
    Select Query
    
    Query Performance
    
    Lab
    
    Conclusion
  - NoSQL
    NoSQL
    
    Introduction
    
    Key Concepts
    
    Conclusion
  - Message Queue
    Message Queue
    
    Introduction
    
    Key Concepts
    
    Conclusion
  - Big Data
    Big Data
    
    Introduction
    
    Evolution and Architecture of Hadoop
    
    Conclusion
- Systems Design
  Systems Design
- Metrics and Monitoring
  Metrics and Monitoring
- Security
  Security
Level 102
Level 102
- Linux Intermediate
  Linux Intermediate
- Linux Advanced
  Linux Advanced
  - Containerization And Orchestration
    Containerization And Orchestration
    
    Introduction
    
    Introduction To Containers
    
    Containerization With Docker
    
    Orchestration With Kubernetes
    
    Conclusion
  - System Calls and Signals
    System Calls and Signals
    
    Introduction
    
    Signals
    
    System Calls
    
    Conclusion
- Networking
  Networking
- System Design
  System Design
- System Troubleshooting and Performance Improvements
  System Troubleshooting and Performance Improvements
  - Introduction
  - Troubleshooting
  - Important Tools
  - Performance Improvements
  - Troubleshooting Example
  - Conclusion Conclusion
    Table of contents
    
    Further readings
- Continuous Integration and Continuous Delivery
  Continuous Integration and Continuous Delivery
Contribute
Code of Conduct
SRE Community

Conclusion

Complex systems have many factors which can go wrong. It can be a bad design & architecture, poorly managed code, poor policies around different caches, bad DB queries or architecture, improper use of resources, or bad OS version, poorly monitored system, datacenter issues, network faults, and many more, Any of these can go wrong.

As an SRE, Knowing important tools/commands, best practices, profiling, benchmarking and scaling can help you with faster troubleshooting and performance improvement of the overall system.

Further readings

Here are some links from the LinkedIn Engineering Blog, as written by LinkedIn engineers, about firefighting they did, ensuring site up 24x7x365.