Difference between revisions of "Campagne de recrutement de scientifiques des données - Page d'information pour les candidats/Data Science GC"

From wiki
Jump to navigation Jump to search
Line 16: Line 16:
  
 
You can also see other [https://www.statcan.gc.ca/eng/data-science/projects Data Science Projects] from the agency.
 
You can also see other [https://www.statcan.gc.ca/eng/data-science/projects Data Science Projects] from the agency.
 +
 +
<nowiki>*</nowiki>Disclamer: questions pertaining to the process specific to data science can be addressed during the information sessions

Revision as of 18:04, 19 October 2022

Data Science in the Government of Canada

What is Data Science?

Data Science is an interdisciplinary field that uses scientific methods and algorithms to extract information and insights from diverse data types. It combines domain expertise, programming skills and knowledge of mathematics and statistics to solve analytically complex problems.

You can can get a short overview of the daily work of a data scientist in the Government of Canada by watching this video.

The COVID-19 Pandemic: A Stark Reminder of the Crucial Role of Data Scientists

Data science allows government agencies and department to respond quickly to changing economic and social situations. For example, Statistics Canada is using the power of data science to support the COVID-19 response in Canada.

The agency collaborated with Health Canada to visualize the supply and demand information for Personal Protective Equipment (PPE). Before the data visualization could begin, the data needed to be extracted and ingested. The data were coming daily from many different sources (different provincial/territorial governments, other federal departments and private sector companies that had been hired to help source the PPE) and in many different formats (e.g. Word documents, Excel files, PDFs) and required a significant amount of manual work to create standardized reports.

To improve this process, data scientists at Statistics Canada created an algorithm that parses the data into different data entries. Machine learning was used to identify numbers and dates within the text. The structured data were then presented in a PowerBI dashboard that was shared with other government departments to meet their information needs and better understand the supply and demand for PPE in Canada.

For more information on Statistics Canada's response to COVID-19, visit the COVID-19: A data perspective portal.

You can also see other Data Science Projects from the agency.

*Disclamer: questions pertaining to the process specific to data science can be addressed during the information sessions