Base Operations is building the world’s largest dataset of global threat patterns and street-level intelligence. We are looking for an exceptional data engineer that can define, build, and mature the data pipelines, models, and warehouse to help us excel in this objective. This individual should have the technical acumen to contribute thought leadership relative to our architecture strategy, while also excelling at mapping that strategy into a tactical plan and executing against it.
RESPONSIBILITIES
- Build, test, and maintain robust ingestion and transformation pipelines, incl. the injection of NLP models to extract and augment data from free text sources.
- Develop data transformation, validation and analysis methods to augment the utility and actionability of data
- Mature data models with respect to consumption patterns and performance objectives.
- Operationalize data quality throughout, ensuring visibility and timely remediation of data quality issues.
- Contribute to GIS data architecture strategies, and drive transformation and implementation activities resulting from those strategies.
REQUIREMENTS
- 5+ years experience building production-level data pipelines and standing up the platforms to support them
- 5+ experience implementing and maintaining data warehouses, data models, and performing schema migrations. Strong working knowledge of SQL
- Hands on knowledge of GIS enabled data stores, e.g., PostGIS, Snowflake, and analytic data platforms, e.g., DataBricks
- Demonstrable thought leadership on operationalizing data quality
- Competent developer with experience delivering production ready code, esp. Python. Familiarity with GIS-related libraries such as GeoPandas, as well as with JavaScript desirable.
- Familiarity with AWS pipeline tools, incl. step functions
- Strong collaborator across functional areas, incl. data science, infrastructure, SW development, and product