GitHub says it will use Copilot interaction data, including inputs, outputs, and code snippets, to train its AI models starting April 24, unless users opt out (Corbin Davenport/How-To Geek)
Why it matters: This policy shift impacts developer privacy and shapes the future of AI-powered coding tools.
- GitHub announced it will use Copilot interaction data, such as inputs, outputs, and code snippets, to train its AI models (How-To Geek, GitHub Blog).
- This policy change is set to begin on April 24, and users must proactively opt out if they do not wish their data to be used (How-To Geek).
- The generative AI models powering various assistants, including Copilot, were built using vast datasets, making this data collection crucial for ongoing improvement (How-To Geek).
GitHub is updating its Copilot data usage policy, effective April 24, to utilize user interaction data—including inputs, outputs, and code snippets—for training its AI models, a move detailed on the GitHub Blog and widely discussed on Hacker News. This change will occur automatically unless users actively opt out, raising questions about data privacy and the future of AI model development.

