~/devreads

#resiliency

6 posts

7 Aug 2024

Sam Shen 7 min read

At Clever we move a lot of data, both from school districts into Clever and from Clever into all the external places that school districts want that data shared. Broadly, we call this behavior “syncing”, and various sync flavors show up across Clever. Let’s talk about two ways to perform data syncs by using an […] The post Everything but…

dataresiliencyinteroperabilitysyncs

10 Aug 2021

Nathan Leiby 5 min read

Why multi-region sessions? Each year leading up to Back to School (our busiest season), Clever’s engineering team invests in our highest traffic systems to make sure we can handle user growth and new traffic patterns. During 2020–2021, SAML auth at Clever grew from <10% of our login related traffic to about 40% of our traffic! For this […] The post…

devopsresiliency

16 Oct 2019

Michael Gao 8 min read

When building systems for new products, there’s a delicate balance between writing code that works and writing code that lasts. A common anti-pattern is preemptively optimizing systems for the future while still trying to find product market fit. For new product teams, this can be a costly mistake as it leads to a slower iterative […] The post Evolving Systems…

generalresiliency

12 Oct 2018

Ben Adida 7 min read

Everyone in the US is now back in school, and we’ve been feeling the rush of excitement here at Clever over the last few weeks. On a typical day at Clever this school year, we regularly hit more than 1,000 logins per second! As we’ve chronicled over the last few months (July, May) , the […] The post Clever SSO…

resiliency

8 Aug 2018

Ben Adida 10 min read

Over the last few months, we’ve been readying Clever SSO for our biggest year ever. We want students across the country logging in quickly and reliably to all of their learning applications. A couple of months ago, we described our initial stress testing approach. In this post, we want to tell you more about this […] The post Clever SSO…

resiliency

16 May 2018

Ben Adida 7 min read

Two months ago, we experienced the worst outage in the history of Clever SSO. We wrote up a postmortem soon afterwards. We mentioned at the time that this postmortem was the beginning of our process to reevaluate everything we do to make sure we can be worthy of the trust you place in us. We […] The post Clever Reliability…

resiliency