Skip to content

Austin's Thoughts

Defense. Space. Technology. Straight Talk.

Menu
  • Books Read Over the Years
  • About Me
  • Contact
Menu
Big Data Sickness

Challenges of DoD AI/ML Path

Posted on August 14, 2024September 1, 2025 by Austin

Big Data Sickness

As the Depart­ment of Defense (DoD) con­tin­ues to explore dif­fer­ent use cas­es for inte­grat­ing Arti­fi­cial Intel­li­gence (AI) into its every­day process­es, it must learn from past lessons where AI deploy­ments failed. One such les­son is the fail­ure of IBM Wat­son deploy­ments. Since its debut on “Jeop­ardy!” IBM Wat­son has been applied to var­i­ous fields, includ­ing health­care, finance, cus­tomer ser­vice, and more, demon­strat­ing its ver­sa­til­i­ty and poten­tial to solve com­plex prob­lems across dif­fer­ent industries.

Data Cleaning

For IBM Wat­son to be effi­cient, it requires clean data to pro­duce pos­i­tive out­puts. Clean data refers to data that has been processed and pre­pared to ensure it is free from errors, incon­sis­ten­cies, and irrel­e­vant infor­ma­tion. Clean data is cru­cial for build­ing accu­rate and reli­able mod­els. How­ev­er, clean data is rare in the amount the DoD pro­duces because it often lacks uni­form metatags, fails to fol­low a par­tic­u­lar for­mat, and is stored in var­i­ous loca­tions like file stor­age or Microsoft Teams environments.

One rea­son IBM Wat­son deploy­ments failed was that the data need­ed to be cleaned, and the DoD need­ed trained data stew­ards who could add con­text to the data results. Devel­op­ing and train­ing good data stew­ards takes time, and at the height of the Glob­al War on Ter­ror­ism (GWOT), no one want­ed to give up their man­pow­er to clean up the data. Data clean­ing is also time-con­sum­ing and often gets del­e­gat­ed to junior enlist­ed per­son­nel who do not have a con­tex­tu­al under­stand­ing of the data.

Data Ownership

The next chal­lenge for a suc­cess­ful IBM Wat­son deploy­ment was get­ting access to the data. With­in a com­bat­ant com­mand, the senior rank­ing com­mand­ing offi­cer is typ­i­cal­ly the data own­er (often del­e­gat­ed to the senior com­mand­ing intel­li­gence offi­cer for respon­si­bil­i­ty, and the senior com­mu­ni­ca­tions offi­cer owns the sys­tems in which the data resides). How­ev­er, it can be chal­leng­ing. The indi­vid­ual pro­gram offices and direc­to­ries that spend their fund­ing to devel­op the sys­tems have con­trac­tu­al own­er­ship over the data.

For exam­ple, sup­pose the Logis­tics Direc­torate devel­ops a pro­gram to keep track of mate­r­i­al with­in their Area of Respon­si­bil­i­ty (AOR), and the direc­torate uti­lizes a Fed­er­al Ser­vice Inte­gra­tor (FSI) to devel­op the pro­gram. In that case, the data is now locked behind sev­er­al lay­ers of bureau­cra­cy. One, if the project is ongo­ing or if the devel­op­ment of the project has stopped, it will require con­tract mod­i­fi­ca­tion to allow for an Appli­ca­tion Pro­gram Inter­face (API) to access the under­ly­ing data or a new con­tract to devel­op new fea­tures with­in a lega­cy code base, which allows for the stat­ed API. Sec­ond­ly, it is often a bur­den­some process to review the lega­cy code base because the orig­i­nal devel­op­ment team, at the time of devel­op­ment, did not com­ment on their code, did not con­sid­er fea­ture sup­port, used pro­pri­etary tools that are no longer sup­port­ed, or com­plete­ly ven­dor-locked in.

Same Problem, New Technology

With the promise of the DoD CIO’s office “Cloud First” approach and its promise of Cloud Ser­vice Providers (CSPs) offer­ing unlim­it­ed access to all the AI good­ness, the DoD is falling into the same trap as it did with IBM Wat­son. For those unfa­mil­iar with the “Cloud First” approach, the DoD CIO’s “Cloud First” strate­gic ini­tia­tive is to mod­ern­ize the Depart­ment of Defense’s IT infra­struc­ture by pri­or­i­tiz­ing adopt­ing cloud com­put­ing tech­nolo­gies. This approach is designed to enhance the agili­ty, effi­cien­cy, and secu­ri­ty of DoD oper­a­tions. As the com­bat­ant com­mands and the ser­vices rushed toward this effort, they often need­ed to real­ize that there were a lot of hid­den costs with the “Cloud First” approach while still not address­ing the issues of data clean­ing and own­er­ship. What mag­ni­fies the issue fur­ther is the lack of spe­cial­ized tal­ent to imple­ment the desired cloud-agnos­tic approach because it requires tal­ent to have in-depth knowl­edge of each CSP to be effective.

“Cloud First” Challenge

The “Cloud First” Approach comes with many hid­den costs. One is the egress cost, or the charges incurred when data is trans­ferred from a cloud provider’s net­work to anoth­er net­work, such as the Inter­net or anoth­er cloud provider. Please note that there was lit­tle to no cost to ingest (data com­ing in) the infor­ma­tion, but the high egress (data going out) cost pre­vent­ed the com­bat­ant com­mands and ser­vices from being cloud-agnostic.

So, the trap is now set. With the promise of AI, the com­bat­ant com­mands and the ser­vices moved their file stor­age to a Cloud Ser­vice Provider (CSP) and, slow­ly, the oth­er parts of their IT infra­struc­ture. Mov­ing the file stor­age was done with lit­tle to no data cleanup, which led to undis­cov­ered data spillage on the exist­ing file stor­age, which now resides on the CSP or data cor­re­la­tion issue. A data cor­re­la­tion issue is where one or more data points can be cor­re­lat­ed to gain more insights. This cor­re­la­tion often clas­si­fies the data at a high­er clas­si­fi­ca­tion. To bypass the issue of data cor­re­la­tion, the com­bat­ant com­mands and ser­vices now over­clas­si­fy the data, which caus­es its own set of issues, pri­mar­i­ly high­er labor costs on per­son­nel to have the nec­es­sary secu­ri­ty clear­ance to access and ana­lyze the data.

Talent Challenge

The DoD fell into the trap of the amount of tal­ent that would be need­ed to main­tain these cloud infra­struc­tures. The DoD went in with the ini­tial thought process­es that they were going to be able to repur­pose the exist­ing tal­ent to main­tain these future sys­tems, but with­out real­iz­ing that there was still an ongo­ing war, which stepped into being able to do some deploy­ment cycles off time that is required to study for these cer­ti­fi­ca­tions. And there were still out­stand­ing sys­tems in place that were required to be able to main­tain while they were try­ing to go to this future cloud environment.

If stat­ed indi­vid­u­als do get trained, have the nec­es­sary cer­ti­fi­ca­tions, and have the secu­ri­ty clear­ance, they will strug­gle with the “Why” state­ment. Why would an indi­vid­ual grade of E‑4 through E‑6 pay or junior offi­cers stay in the ser­vice when their coun­ter­part in the civil­ian sec­tor ini­tial­ly makes between $75,000 to $140,000 with­out the addi­tion­al respon­si­bil­i­ties of serv­ing?  Many make the deci­sion to leave the mil­i­tary, which adds to the con­tin­ued gap of not hav­ing enough cleared per­son­nel with the nec­es­sary clear­ance for the sug­gest­ed AI/ML strat­e­gy or Cloud First Approach with­out hav­ing to sub­con­tract the work.  Along with the his­tor­i­cal­ly low recruit­ing num­bers, the gap con­tin­ues to grow.

Recommendations

The Depart­ment of Defense (DoD) is at a crit­i­cal cross­roads in pur­su­ing AI and ML inte­gra­tion. To nav­i­gate these chal­lenges effec­tive­ly, I pro­pose the fol­low­ing com­pre­hen­sive steps:

1. Mod­ern­ize Con­trac­tu­al Frame­works and Lega­cy Systems
To inte­grate AI and ML effec­tive­ly, the DoD must mod­ern­ize its con­trac­tu­al frame­works and lega­cy sys­tems. This involves con­duct­ing a com­pre­hen­sive review of exist­ing con­trac­tu­al lan­guage to iden­ti­fy mis­sion-crit­i­cal pro­grams. A sys­tem­at­ic process should be ini­ti­at­ed to update lega­cy code bases, pri­or­i­tiz­ing those essen­tial for AI/ML inte­gra­tion. New con­trac­tu­al tem­plates must also be devel­oped to facil­i­tate eas­i­er data access and shar­ing across dif­fer­ent DoD enti­ties, ensur­ing a more stream­lined and effi­cient approach to data man­age­ment and tech­no­log­i­cal advancement.

2. Imple­ment a Data-Cen­tric Approach
A data-cen­tric approach is cru­cial for the DoD’s AI and ML ini­tia­tives. This strat­e­gy begins with estab­lish­ing a robust data gov­er­nance frame­work pri­or­i­tiz­ing data qual­i­ty, secu­ri­ty, and inter­op­er­abil­i­ty across all sys­tems. Com­pre­hen­sive data audits should be con­duct­ed to thor­ough­ly under­stand exist­ing data assets, their qual­i­ty, and their poten­tial val­ue for AI/ML appli­ca­tions. Fur­ther­more, clear cri­te­ria must be devel­oped to guide deci­sions on which data sets should be migrat­ed to the cloud. These cri­te­ria should con­sid­er data sen­si­tiv­i­ty, oper­a­tional impor­tance, and the poten­tial for AI/ML uti­liza­tion, ensur­ing that cloud migra­tion efforts are strate­gic and aligned with the DoD’s broad­er tech­no­log­i­cal goals.

3. Stan­dard­ize Cloud Infrastructure
Stan­dard­iz­ing cloud infra­struc­ture is essen­tial for the DoD to max­i­mize effi­cien­cy and inter­op­er­abil­i­ty in its AI and ML ini­tia­tives. Although con­tracts have been award­ed to mul­ti­ple Cloud Ser­vice Providers (CSPs), focus­ing on a pri­ma­ry plat­form to stream­line oper­a­tions is cru­cial. Ama­zon Web Ser­vices (AWS) is the lead­ing can­di­date due to its high adop­tion rates and robust secu­ri­ty fea­tures. How­ev­er, this deci­sion should not be sta­t­ic; reg­u­lar reassess­ments should be con­duct­ed to ensure the cho­sen plat­form meets the DoD’s evolv­ing needs and keeps pace with tech­no­log­i­cal advance­ments. A com­pre­hen­sive migra­tion strat­e­gy must be devel­oped to facil­i­tate this stan­dard­iza­tion, encom­pass­ing detailed time­lines, resource allo­ca­tion plans, and risk mit­i­ga­tion strate­gies. This approach will ensure a smooth tran­si­tion to a stan­dard­ized cloud infra­struc­ture while min­i­miz­ing dis­rup­tions and secu­ri­ty risks.

4. Invest in Human Cap­i­tal and Knowl­edge Management
Invest­ing in human cap­i­tal and knowl­edge man­age­ment is crit­i­cal for the DoD’s suc­cess­ful imple­men­ta­tion of AI and ML tech­nolo­gies. This involves estab­lish­ing ded­i­cat­ed knowl­edge man­age­ment teams and data stew­ard­ship roles across the orga­ni­za­tion to ensure prop­er over­sight and uti­liza­tion of data assets. Com­pre­hen­sive train­ing pro­grams should be devel­oped to upskill exist­ing per­son­nel in cloud tech­nolo­gies, data man­age­ment, and AI/ML appli­ca­tions, cre­at­ing a work­force capa­ble of lever­ag­ing these advanced tech­nolo­gies effec­tive­ly. To address the chal­lenge of tal­ent reten­tion, the DoD must cre­ate attrac­tive career paths and imple­ment robust reten­tion strate­gies that can com­pete with pri­vate sec­tor oppor­tu­ni­ties. This approach will help build and main­tain a skilled work­force capa­ble of dri­ving the DoD’s tech­no­log­i­cal ini­tia­tives for­ward, ensur­ing that the orga­ni­za­tion can ful­ly cap­i­tal­ize on the poten­tial of AI and ML in its operations.

5. Enhance Resilience and Redundancy
Enhanc­ing resilience and redun­dan­cy is cru­cial for the Depart­ment of Defense (DoD) to ensure unin­ter­rupt­ed data access and com­mu­ni­ca­tion in the face of poten­tial threats to cloud access points (CAPs). The DoD must rec­og­nize that CAPs may become tar­gets, poten­tial­ly dis­rupt­ing crit­i­cal oper­a­tions for com­bat com­mands. To mit­i­gate this risk, robust redun­dan­cy mea­sures should be imple­ment­ed. This can include deploy­ing AWS Snow­ball or sim­i­lar edge com­put­ing solu­tions at com­bat com­mands, enabling con­tin­ued data access and deci­sion-mak­ing capa­bil­i­ties even under Title 10 author­i­ty. Addi­tion­al­ly, the DoD should devel­op and reg­u­lar­ly test con­tin­gency plans to pre­pare for sce­nar­ios where cloud access is com­pro­mised. By imple­ment­ing these mea­sures, the DoD can enhance its abil­i­ty to main­tain oper­a­tional effec­tive­ness and ensure the avail­abil­i­ty of crit­i­cal data and com­mu­ni­ca­tion chan­nels in the face of poten­tial disruptions.

6. Real­is­tic Cost-Ben­e­fit Analysis
A real­is­tic cost-ben­e­fit analy­sis is essen­tial for the DoD’s AI and ML ini­tia­tives, mov­ing beyond the sim­plis­tic view of cloud migra­tion as mere­ly a cost-sav­ing mea­sure. This approach requires thor­ough, long-term cost-ben­e­fit analy­ses that account for often-over­looked expens­es such as data egress fees, ongo­ing train­ing require­ments, and sys­tem updates. These com­pre­hen­sive assess­ments should con­sid­er both imme­di­ate and future finan­cial impli­ca­tions, as well as oper­a­tional ben­e­fits. Addi­tion­al­ly, it’s cru­cial to reg­u­lar­ly eval­u­ate and report on the tan­gi­ble ben­e­fits and return on invest­ment (ROI) of AI/ML ini­tia­tives. This con­tin­u­ous assess­ment process not only jus­ti­fies con­tin­ued invest­ment but also helps iden­ti­fy areas for improve­ment and opti­miza­tion. By adopt­ing this more nuanced and holis­tic approach to finan­cial analy­sis, the DoD can make more informed deci­sions about resource allo­ca­tion and ensure that its AI and ML invest­ments deliv­er real, mea­sur­able val­ue to its oper­a­tions and over­all mission.

7. Align AI/ML Ini­tia­tives with Warfight­er Needs
Align­ing AI/ML ini­tia­tives with warfight­er needs is cru­cial for the DoD to max­i­mize the impact of these tech­nolo­gies on oper­a­tional effec­tive­ness. This align­ment requires estab­lish­ing clear and direct lines of com­mu­ni­ca­tion between AI/ML devel­op­ment teams and front­line mil­i­tary per­son­nel, ensur­ing that tech­no­log­i­cal solu­tions are tai­lored to real-world oper­a­tional require­ments. The DoD should pri­or­i­tize AI/ML projects that direct­ly enhance mis­sion capa­bil­i­ties and oper­a­tional effec­tive­ness, focus­ing resources on ini­tia­tives that pro­vide tan­gi­ble ben­e­fits to warfight­ers. To mea­sure the suc­cess of these efforts, it’s essen­tial to devel­op com­pre­hen­sive met­rics that assess the impact of AI/ML inte­gra­tion on mis­sion suc­cess and warfight­er effec­tive­ness. These met­rics should go beyond tech­ni­cal per­for­mance to include prac­ti­cal out­comes in the field. By main­tain­ing this close align­ment between tech­no­log­i­cal devel­op­ment and oper­a­tional needs, the DoD can ensure that its AI/ML ini­tia­tives deliv­er mean­ing­ful improve­ments to mil­i­tary capa­bil­i­ties and con­tribute sig­nif­i­cant­ly to over­all mis­sion success.

Conclusion

The Depart­ment of Defense’s jour­ney towards effec­tive AI and ML inte­gra­tion is com­plex and mul­ti­fac­eted, requir­ing a com­pre­hen­sive and strate­gic approach. The lessons learned from past fail­ures, such as the IBM Wat­son deploy­ments, high­light the crit­i­cal impor­tance of address­ing fun­da­men­tal issues before rush­ing into new tech­no­log­i­cal initiatives.

The DoD faces sig­nif­i­cant chal­lenges in data clean­ing, data own­er­ship, cloud infra­struc­ture stan­dard­iza­tion, and tal­ent reten­tion. These chal­lenges are com­pound­ed by the rapid pace of tech­no­log­i­cal advance­ment and the com­pet­i­tive land­scape for skilled per­son­nel. How­ev­er, by imple­ment­ing the rec­om­mend­ed steps, the DoD can build a sol­id foun­da­tion for suc­cess­ful AI and ML integration.

The key to this suc­cess is a shift towards a data-cen­tric approach that empha­sizes the impor­tance of clean, acces­si­ble, and prop­er­ly man­aged data. This must be cou­pled with mod­ern­ized con­trac­tu­al frame­works and lega­cy sys­tems that facil­i­tate rather than hin­der tech­no­log­i­cal advance­ment. Stan­dard­iz­ing cloud infra­struc­ture while main­tain­ing flex­i­bil­i­ty for future needs is cru­cial for effi­cien­cy and interoperability.

Invest­ing in human cap­i­tal through com­pre­hen­sive train­ing pro­grams and attrac­tive career paths is essen­tial for build­ing and retain­ing the nec­es­sary tal­ent pool. This invest­ment must be bal­anced with real­is­tic cost-ben­e­fit analy­ses that con­sid­er both the imme­di­ate and long-term impli­ca­tions of AI and ML initiatives.

Per­haps most impor­tant­ly, the DoD must ensure its AI and ML ini­tia­tives align with warfight­er needs. This align­ment will ensure that tech­no­log­i­cal advance­ments trans­late into tan­gi­ble improve­ments in oper­a­tional effec­tive­ness and mis­sion success.

The path for­ward requires patience, strate­gic think­ing, and a will­ing­ness to address sys­temic issues. It demands a bal­ance between embrac­ing cut­ting-edge tech­nolo­gies and main­tain­ing the robust­ness and secu­ri­ty required for mil­i­tary oper­a­tions. By adopt­ing this com­pre­hen­sive approach, the DoD can avoid the pit­falls of past ini­tia­tives and build an AI and ML ecosys­tem that tru­ly enhances its oper­a­tional capabilities.

Ulti­mate­ly, the suc­cess­ful inte­gra­tion of AI and ML tech­nolo­gies has the poten­tial to sig­nif­i­cant­ly enhance the DoD’s abil­i­ty to meet cur­rent and future chal­lenges. How­ev­er, this poten­tial can only be real­ized through care­ful plan­ning, strate­gic imple­men­ta­tion, and a con­tin­u­ous com­mit­ment to learn­ing and adap­ta­tion. As the DoD moves for­ward, it must remain focused on its core mis­sion while lever­ag­ing these tech­nolo­gies to cre­ate a more agile, effi­cient, and effec­tive defense force for the future.

Like this:

Like Load­ing…

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

  • Books Read Over the Years
  • About Me
  • Contact

Archives

  • April 2026
  • March 2026
  • February 2026
  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • April 2025
  • March 2025
  • February 2025
  • January 2025
  • November 2024
  • October 2024
  • August 2024
  • June 2024
  • May 2024
  • January 2024
  • August 2023
  • June 2023
  • January 2023
  • November 2022
  • October 2022
  • August 2022
  • May 2022
  • November 2016
  • November 2015
  • October 2015
  • April 2015
  • July 2013
  • March 2013
  • February 2013
  • January 2013
  • August 2012
  • April 2012
  • March 2012
  • February 2012
  • August 2011
© 2026 Austin's Thoughts | Powered by Superbs Personal Blog theme
Manage Cookie Consent
We use cookies to optimize our website and our service.
Functional Always active
The technical storage or access is strictly necessary for the legitimate purpose of enabling the use of a specific service explicitly requested by the subscriber or user, or for the sole purpose of carrying out the transmission of a communication over an electronic communications network.
Preferences
The technical storage or access is necessary for the legitimate purpose of storing preferences that are not requested by the subscriber or user.
Statistics
The technical storage or access that is used exclusively for statistical purposes. The technical storage or access that is used exclusively for anonymous statistical purposes. Without a subpoena, voluntary compliance on the part of your Internet Service Provider, or additional records from a third party, information stored or retrieved for this purpose alone cannot usually be used to identify you.
Marketing
The technical storage or access is required to create user profiles to send advertising, or to track the user on a website or across several websites for similar marketing purposes.
  • Manage options
  • Manage services
  • Manage {vendor_count} vendors
  • Read more about these purposes
Preferences
  • {title}
  • {title}
  • {title}
%d