about summary refs log tree commit diff
path: root/content/resume.md
blob: 7797d54346607f19913f2d8651ebf0b8b88a9444 (plain) (blame)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
---
title: Resume
template: orphan.html
---

# Franck Cuny

Technical Director Site Reliability Engineer

Email: franck@fcuny.net | Phone: 415-617-5129

Results-driven Site Reliability Engineering leader with extensive experience in architecting, scaling, and optimizing large-scale distributed systems. Proven track record of driving reliability improvements, fostering cross-functional collaboration, and mentoring engineering talent. Dedicated to building resilient infrastructures and cultivating a strong reliability culture.

## Core Competencies:

- Technical leadership and mentorship
- Cross-team collaboration and communication
- Large-scale distributed systems architecture
- Reliability engineering and disaster recovery
- Infrastructure optimization and cost reduction
- Production readiness and failure testing methodologies

## Career Focus:

Seeking opportunities to lead transformative reliability initiatives, mentor the next generation of SREs, and drive architectural decisions that significantly enhance system resilience and performance at scale.

# Experience

## Roblox, San Mateo

<table border="2" cellspacing="0" cellpadding="6" rules="groups" frame="hsides">

<colgroup>
<col  class="org-left" />

<col  class="org-left" />

<col  class="org-left" />

<col  class="org-left" />
</colgroup>
<tbody>
<tr>
<td class="org-left">Site Reliability Engineer</td>
<td class="org-left">Technical Directory (IC7)</td>
<td class="org-left">SRE Group</td>
<td class="org-left">August 2024 - to date</td>
</tr>
<tr>
<td class="org-left">Site Reliability Engineer</td>
<td class="org-left">Principal II (IC6)</td>
<td class="org-left">SRE Group</td>
<td class="org-left">Feb 2022 - August 2024</td>
</tr>
</tbody>
</table>

As a Team Lead for the Site Reliability group, I define road-maps, milestones, and identify areas where SREs can partner with different teams to improve overall reliability of our infrastructure and services. Key projects and responsibilities include:

- **Cell Architecture Implementation**: Led the SRE effort to transition from monolithic Compute clusters to a Cell architecture, significantly enhancing Roblox's infrastructure resilience and efficiency. Developed migration plans, identified necessary automation, and drove production readiness for this critical reliability improvement.

- **Edge Infrastructure Migration**: Spearheaded the migration from HAproxy to Envoy at the edge, aimed at reducing failure domains, improving performance by streamlining the proxy chain, and enabling user traffic steering to specific cells from the edge.

- **Active/Passive Reliability Lead**: Orchestrated the failover strategy across multiple teams, developing detailed action plans and validation procedures. Conducted comprehensive tests to ensure plan effectiveness. This work reduced the amount of time for a fail-over from days to hours.

- **Reliability Culture Champion**: Mentored engineers of various levels (both SREs and SWEs), established a model for production readiness, and popularized the practice of running failure exercises for new large infrastructure projects.

- **Technical Leadership**: Acted as tech lead on numerous projects, demonstrating strong cross-team collaboration skills. Provided technical guidance and mentorship to the SRE team, fostering a culture of reliability and continuous improvement.

Key strengths include driving complex infrastructure projects, mentoring, setting reliability standards, and facilitating effective cross-team collaboration.

## Twitter, San Francisco

<table border="2" cellspacing="0" cellpadding="6" rules="groups" frame="hsides">
<colgroup>
<col  class="org-left" />
<col  class="org-left" />
<col  class="org-left" />
<col  class="org-left" />
</colgroup>
<tbody>
<tr>
<td class="org-left">Site Reliability Engineer</td>
<td class="org-left">Senior Staff</td>
<td class="org-left">Compute SRE</td>
<td class="org-left">Jan 2018 - Jan 2022</td>
</tr>
<tr>
<td class="org-left">Site Reliability Engineer</td>
<td class="org-left">Staff</td>
<td class="org-left">Storage SRE</td>
<td class="org-left">Aug 2014 - Jan 2018</td>
</tr>
</tbody>
</table>

### Key Achievements and Responsibilities:

- **Large-Scale Infrastructure Management**: Led SRE efforts for one of the world's largest compute clusters (Mesos), spanning hundred of thousands of nodes across multiple data centers. Defined KPIs and improved automation for managing a massive fleet of bare metal machines.

- **Kubernetes Adoption**: Spearheaded the initiative to adopt Kubernetes for on-premise infrastructure, driving architectural decisions and implementation strategies.

- **Cost Optimization**: Designed and implemented strategies that significantly improved hardware utilization, resulting in tens of millions of dollars in savings on hardware costs.

- **Tech Leadership**: Served as Tech Lead for a team of 6 SREs supporting Compute infrastructure. Established critical team processes including on-call rotations and postmortem procedures.

- **Cloud and On-Premise Expertise**: Led multiple efforts related to Kubernetes deployment and management, both in cloud environments and on-premise infrastructure.

- **Storage Systems Migration**: Successfully migrated all pub-sub systems from bare-metal deployment to Aurora/Mesos, pioneering the adoption of the Compute orchestration platform among storage teams. This transition reduced operational overhead, decreased deployment times, and enhanced overall system reliability.

- **Network Infrastructure Improvement**: Advocated for and implemented the adoption of 10Gb+ networking in data centers, enabling significant scaling improvements for storage systems.

- **Cross-Functional Leadership**: Served as the SRE Tech Lead for the real time storage team, driving improvements in performance, operations, and automation across storage systems.

I consistently demonstrated the ability to lead complex technical initiatives, deliver impactful projects on-time, optimize large-scale systems, and drive cross-functional collaboration to achieve significant improvements in infrastructure reliability, efficiency, and cost-effectiveness.

## Say Media, San Francisco

<table border="2" cellspacing="0" cellpadding="6" rules="groups" frame="hsides">

<colgroup>
<col  class="org-left" />

<col  class="org-left" />

<col  class="org-left" />

<col  class="org-left" />
</colgroup>
<tbody>
<tr>
<td class="org-left">Software Engineer</td>
<td class="org-left">Senior SWE</td>
<td class="org-left">Infrastructure</td>
<td class="org-left">Aug 2011 - Aug 2014</td>
</tr>
</tbody>
</table>

During my time at Say Media, I worked on two different teams. I started as a software engineer in the platform team building APIs then I then transitioned to the operation team to develop tooling in order to increase the effectiveness of the engineering organization.

## Linkfluence, Paris

<table border="2" cellspacing="0" cellpadding="6" rules="groups" frame="hsides">

<colgroup>
<col  class="org-left" />

<col  class="org-left" />

<col  class="org-left" />

<col  class="org-left" />
</colgroup>
<tbody>
<tr>
<td class="org-left">Software Engineer</td>
<td class="org-left">Senior SWE</td>
<td class="org-left">Infrastructure</td>
<td class="org-left">July 2007 - July 2011</td>
</tr>
</tbody>
</table>

I was one of the early engineers joining Linkfluence in 2007. I led the development of the company's crawler (web, feeds). I was responsible for defining the early architecture of the company, and designed the internal platforms (Service Oriented Architecture).
I contributed to multiple open source projects on behalf of the company and represented the company at numerous open source conferences in Europe.