Data Platform Engineering

The infrastructure and services maintained by the Data Platform Engineering team support data producers and consumers in collecting, discovering, and using trustworthy data to derive data insights, conduct research and build new data products.

Documentation for users and administrators of Data Platform systems, including the Data Lake, analytics tools, Event Platform, Metrics Platform, and data pipelines.

List of teams in the Data Platform Engineering group, links to their documentation, and information about team processes, current projects, and roadmaps.

To contact us, please use the intake process.

About the Data Platform Engineering docs

Page categories

Category metadata is easier to change than page structure, and it provides another means to explore related pages in this domain. Expand the section below to see the major categories that exist for these docs.

Category list


  • Analytics_cluster
  • Analytics_Query_Service
  • Dashiki
  • Data_domains
    • Contribution_data
      • Edits_data
      • Editors_data
    • Traffic_data
      • Pageviews
      • Unique_devices
    • Content_data
  • Data_pipelines
  • Data Platform
  • Data_Platform_systems
  • Data_stream
  • Dumps
  • Metrics
  • Query_engines
    • Spark
    • Hive
    • Presto
  • Query_examples
  • WDQS
  • Wikistats
  • Decision logs

Guidelines for maintaining these docs

TODO: Documentation maintenance guidelines

Common questions: - Where to put decision records, evaluations - Where to put project updates and product roadmaps - Where to put metrics documentation

Keys to sustainable maintenance: - Apply categories to pages - Cross-link between wikis, DataHub, Github, etc - Avoid many levels of deep page nesting (more than 3 is probably too deep) - Docs on Wikitech should generally be documentating how to use or administer a technology/system, not documenting the team that maintains a given technology/system (this is not a pattern that has consistently been followed in the past, so current state of docs on Wikitech doesn't reflect it)

This article is issued from Wikimedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.