Assistant Technical Manager, IT Disaster Recovery
Who are we?
We are the IT Division of HKJC, a vibrant community of over 1,500 dedicated professionals working collaboratively across Hong Kong and Shenzhen.
Our team is a diverse mix of individuals from various backgrounds, from all across the world. We embrace our humanity, recognizing that each of us brings unique strengths and perspectives. This diversity not only enriches our work environment but also drives our innovation and creativity as we strive to achieve our collective goals.
What do we do?
We design, build, and operate the technology that powers the Club. Our primary focus is on delivering the service that supports our hospitality, racing and wagering operations, to ensure that our customers and members enjoy exceptional experiences.
We also deliver the changes necessary to drive business growth through new products and services. And, we are committed to safeguarding the Club by protecting it from external threats, providing a secure and resilient technological environment.
The Department
The IT Infrastructure and Platform Operations Department is responsible for the design, implementation, and management of the infrastructure that supports the Club’s IT systems, and leads the Service Management capabilities that ensure the smooth running of these systems.
This department ensures that all technological resources operate efficiently and effectively to support business objectives. Key responsibilities include:
- Design and operate processes and controls that ensure IT service availability, performance, and resilience are aligned with business expectations.
- Manage the 24x7 IT Operations Centre.
- Manage the Club’s exploitation of the public cloud.
- Manage the complete lifecycle of the Club’s IT network and the technology within our data centres.
- Provide the roadmaps, standards, and capabilities that enable our IT infrastructure to remain current (eligible for vendor support) and secure (patched and remediated against CVEs).
- Provide the Club’s colleague collaboration technology suite, including desktop and laptop computers, mobile devices, collaboration tools, carrier contracts, and associated support functions.
The Job
You will:
- Support Development and Implementation of Disaster Recovery Plans: Assist in maintaining and updating disaster recovery (DR) plans, ensuring documentation of recovery objectives (RTOs, RPOs), backup protocols, and restoration procedures is accurate and current. Help prepare for and participate in DR plan testing and validation exercises, contributing to the documentation of lessons learned. Support the administration of required resources and tools needed for DR activities under senior management guidance
- Assist in Risk Assessments and Impact Analysis: Help collect and analyse data to identify vulnerabilities across IT applications and infrastructure. Participate in evaluating potential disaster scenarios and their impact on critical systems. Collaborate with internal teams and business units to support Business Impact Analyses (BIAs), ensuring understanding of system interdependencies and recovery priorities
- Coordinate with Internal Teams and Support Communications: Work closely with cross-functional IT, operations, and business teams to facilitate alignment of DR plans with business goals. Assist in integrating Non-Functional Requirements (NFRs) and operational criteria into technical documentation as directed. Support Site Reliability Engineering (SRE) initiatives by contributing technical insights and operational feedback
- Contribute to Monitoring and Testing of Recovery Procedures:Help organise and conduct DR simulations, drills, and table-top exercises. Document test outcomes and assist in updating recovery plans based on findings and changing the IT landscape. Track compliance with DR readiness standards and contribute to continuous improvement efforts
- Support Training and Awareness Programs: Assist in delivering employee training sessions, coaching team members on DR procedures and recovery roles. Help coordinate drills and awareness campaigns to promote operational readiness and technical skill development
- Assist in Recovery Operations and Problem Management: Provide operational support during disaster recovery events by liaising with technical teams and ensuring timely communication of status updates. Participate in incident and problem management activities by helping identify recurring IT issues, collecting problem-related data, and documenting findings. Contribute to root cause analyses and track remediation actions to minimise incident recurrence and improve IT resilience
- Contribute to Team Effectiveness and Continuous Improvement: Support team performance by managing assigned tasks efficiently and assisting management in resource coordination. Help drive maturity improvements in DR and problem management processes according to established frameworks and best practices
About You
You should have:
- University degree qualification with a strong technical background, particularly in Information Technology, cybersecurity, application development and/or networking
- Over 4 years of IT experience managing complex production and testing environments within large organisations
- Experienced in developing and implementing IT Disaster Recovery (DR) plans, including setting Recovery Time Objectives (RTOs) and Recovery Point Objectives (RPOs)
- Proven track record delivering mission-critical systems with strong ownership and accountability
- Excellent interpersonal and communication skills, able to engage effectively with IT teams, business users, and executive management
- Proficient in English (spoken and written), Cantonese, and Putonghua; strong presentation and writing skills
- Good commercial awareness and understanding of contractual issues related to IT services
- Experience in vendor management and coordination with external partners
- Strong problem-solving, troubleshooting, and diagnostic skills to address complex IT issues quickly and effectively
- Solid understanding of ITIL and service management frameworks, covering Incident, Problem, Change, Asset, Configuration, and Service Level Management
- Expertise in various data backup and recovery technologies, including cloud-based, on-premises, and hybrid solutions
- Knowledgeable in Site Reliability Engineering (SRE) practices that emphasise automation, monitoring, and maintaining high availability of IT systems
- Strong understanding of IT infrastructure components, including servers, networks, storage, and databases
- Experience conducting disaster recovery tests, drills, and table-top exercises to evaluate and improve plan effectiveness
- Proven ability to manage DR projects end-to-end: planning, execution, progress monitoring, and stakeholder communication
- Familiarity with compliance requirements and industry standards relevant to disaster recovery and business continuity
Terms of Employment
The level of appointment will be commensurate with qualification and experience.
How to Apply
Please send your resume, complete with expected salary and job reference by clicking the Apply Now button or to:
Fax: 2966-5770
Mail: The Human Resources Department, The Hong Kong Jockey Club, 1 Sports Road, Happy Valley, Hong Kong
We are an equal opportunity employer. Personal data provided by job applicants will be used strictly in accordance with the Club's notice to employees and prospective employees relating to the Personal Data (Privacy) Ordinance. A copy of which will be provided immediately upon request.
Share this Job :
To share this job on WeChat, please click the button below to copy the link: