Abstract
The introduction and increasing popularity of artificial intelligence (AI) and machine learning (ML) technologies allow organizations to gain valuable insights from their copious amounts of data. However, legacy organizations often struggle to overcome outdated data management practices and unleash the potential of AI and ML on their data. There is simply too much data to sift through manually. Therefore, a data science tool is required to locate relevant information effectively within an organizations’ data lake. This paper presents a survey of challenges government organizations face related to this data discovery issue. The challenges are ubiquitous across mission sets, covering human resources and personnel management, logistics and supply chains, fraud and predatory business detection, government procurement, and civil litigation. This paper introduces the Data Discovery by Example (DICE) system to alleviate this problem. Unlike traditional data discovery techniques to find data of interest within a data lake, DICE alleviates the need to write queries and does not require users to manually inspect the lake to find their data of interest. Lastly, we walk through a DICE example, where we apply the tool on data from complaints of predatory business practices. This example highlights the challenges of acquiring, accessing, and interpreting data across multiple agencies and functions. There is an opportunity for innovative, ML-enabled data discovery solutions, such as DICE, to help unlock the value of data and augment development of a modern government organization.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Newman, J.: Toward AI security: global aspirations for a more resilient future. University of California, Berkeley, Center for Long-Term Cybersecurity, 21, 24, 30, 34 (2019)
Koumpan, E.: 13 Tips & techniques to use when visualizing data. benelux intelligence community. https://www.bi-kring.nl/component/easytagcloud/293-module/120-data-analytics. Accessed 25 Sept 2022
Vrontis, D., et al.: Artificial intelligence, robotics, advanced technologies and human resource management: a systematic review. Int. J. Hum. Resour. Manag. 33(6), 1237–1266 (2021)
United States air force. https://www.af.mil/About-Us/. Accessed 25 Sept 2022
Gross, C.: MilPDS upgrade successful. https://www.buckley.spaceforce.mil/News/Article-Display/Article/322518/milpds-upgrade-successful/. Accessed 28 Sept 2022
Air force personnel center. https://www.afpc.af.mil/Career-Management/. Accessed 25 Sept 2022
Secretary of the air force public affairs: DAF to launch myEval in 2022. https://www.af.mil/News/Article-Display/Article/2892242/daf-to-launch-myeval-in-2022/. Accessed 28 Sept 2022
Department of defense budget fiscal year 2023. https://comptroller.defense.gov/Portals/45/Documents/defbudget/FY2023/FY2023_p1.pdf. Accessed 25 Sept 2022
Business and enterprise systems directorate: modern logistics system aids, tracks air force inventory. Air force. https://www.af.mil/News/Article-Display/Article/2891103/modern-logistics-system-aids-tracks-air-force-inventory/. Accessed Accessed 25 Sept 2022
F-35 Joint Program Office: F-35 Joint Program Office completes initial deployment of new, improved logistics hardware. Edwards Air Force Base. https://www.edwards.af.mil/News/Article/2919347/f-35-joint-program-office-completes-initial-deployment-of-new-improved-logistic/ Accessed 25 Sept 2022
Defense Property Accountability System https://dpassupport.golearnportal.org/index.php/system-solutions-dpas/by-business-need/140-dpas-overview Accessed 25 Sept 2022
West, D.M.: Using AI and machine learning to reduce government fraud. Brookings, https://www.brookings.edu/research/using-ai-and-machine-learning-to-reduce-government-fraud/. Accessed 29 Sept 2022
Meister, L., et al.: Servicemember financial protection: a overview of key federal laws and regulations. Consum. Compliance Outlook 2, 10–19 (2017)
Air Force Instruction 51–304, Legal Assistance, Notary, Preventive Law, and Tax Programs (2018)
CFR § 631.11, Off-limits establishments and areas
Rezig, E., et al.: Examples are all you need: iterative data discovery by example in data lakes. In: 12th Annual Conference on Innovative Data Systems Research, Chaminade, USA (2022)
Girth, A. M., Snider, K. F.: Acquisition in U.S. federal agencies: evidence from the world’s largest buyer. J. Strateg. Contracting Negot. 4(1–2), 3–5 (2020)
U.S.C. § 3306(a)(1)(B) (2018)
U.S.C. § 3101 et seq. (2018)
U.S.C. §§ 3702(b), 3703(c) (2018)
Federal Acquisition Regulation 9.104–1
Tillipman, J.: Using AI to Reduce Performance Risk in U.S. Procurement. The Regulatory Review (2022). https://www.theregreview.org/2022/06/29/tillipman-using-ai-to-reduce-performance-risk-in-u-s-procurement/
Federal Acquisition Regulation 9.103(b)
Contractor Performance Assessment Reporting System. https://cpars.gov/. Accessed 30 Sept 2022
Electronic Discovery Reference Model. https://edrm.net/edrm-model/. Accessed 30 Sept 2022
Acknowledgements
The authors wish to acknowledge the following individuals for their contributions and support: Bob Bond, Jeremy Kepner, Tucker Hamilton, Garry Floyd, Mike Kanaan, Tim Kraska, Charles Leiserson, Christian Prothmann, John Radovan, Daniela Rus, Allan Vanterpool, Ben Price, Michael Stonebraker, Anshul Bhandari, Tameka Collier, Renee Collier, Robert Preston, C. Taylor Smith, James Hanley, Ryan Holte, and Kate Reece. This research was sponsored by the United States Air Force Research Laboratory and the United States Air Force Artificial Intelligence Accelerator and was accomplished under Cooperative Agreement Number FA8750–19-2–1000. The views and conclusions contained in this document are those of the authors and should not be interpreted as representing the official policies, either expressed or implied, of the United States Air Force or the U.S. Government. The U.S. Government is authorized to reproduce and distribute reprints for Government purposes notwithstanding any copyright notation herein.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Bowne, A., McEvoy, L., Gupta, D., Brown, C., Gadepally, V., Rezig, E.K. (2022). A Survey of Data Challenges Across a Modernizing Bureaucracy: A New Perspective on Examining Old Government Problems. In: Rezig, E.K., et al. Heterogeneous Data Management, Polystores, and Analytics for Healthcare. DMAH Poly 2022 2022. Lecture Notes in Computer Science, vol 13814. Springer, Cham. https://doi.org/10.1007/978-3-031-23905-2_2
Download citation
DOI: https://doi.org/10.1007/978-3-031-23905-2_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-23904-5
Online ISBN: 978-3-031-23905-2
eBook Packages: Computer ScienceComputer Science (R0)