Posted on: Thu Oct 12 2023
Image Credit: Lerbank-bbk22
This issue of Data Delves covers our biweekly roundup of the top 10 data-related news and articles from September 29, 2023 to October 12, 2023.
1. New data shows that without intervention, generative AI jobs will continue to cluster in the same big tech hubs
The article discusses the concentration of generative AI jobs in just a few key hubs, despite the belief that open-source software would allow anyone to compete. The winner-take-most dynamics of digital technologies and the mechanics of AI itself cut against decentralization, at least among cities. Over 60% of generative AI jobs posted in the year ending in July 2023 were clustered in just 10 metro areas, many of which are the same large coastal hubs that AI optimists believe the technology should diffuse out of. The article suggests that policymakers and technologists need to take into account the costs, technical talent, and computing hurdles that tend to locate the building and training of foundation models in frontier hubs. The article proposes several potential federal, state, and local actions to promote a more inclusive AI geography, including expanding the National Science Foundation’s National Artificial Intelligence Research Institutes program and establishing the proposed National AI Research Resource (NAIRR) to democratize access to essential data and computational resources.
Read full article: https://www.brookings.edu/articles/new-data-shows-that-without-intervention-generative-ai-jobs-will-continue-to-cluster-in-the-same-big-tech-hubs/
2. WiMi Proposed Intelligent Travel Recommendation System Based on Big Data Analytics Modeling
WiMi Hologram Cloud Inc. has proposed an intelligent personalized travel recommendation system based on an augmented big data analytics model. The system incorporates user preferences, dynamic environments, desired activities, lifestyle experiences, and real-world problems to identify and recommend the most appropriate set of destinations for travel. The system uses deep learning, natural language processing, data mining, social media analysis, and GIS to synthesize various data sources and provide travelers with personalized and context-driven recommendations. The system also provides real-time updates and a user participation mechanism to optimize the recommendation algorithm and improve the accuracy and user satisfaction of the system. The development of this travel recommendation system is based on advanced technology and intelligent analysis that offers several key benefits, including personalized recommendations, context-driven recommendations, multi-source data integration, and real-time updates.
Read full article: https://en.prnasia.com/releases/global/wimi-proposed-intelligent-travel-recommendation-system-based-on-big-data-analytics-modeling-420846.shtml
3. Visa Launches $100 million Generative AI Ventures Initiative
Visa, a global leader in payments, has announced a new $100 million initiative called generative AI ventures. The initiative aims to invest in the next generation of companies that are focused on developing generative AI technologies and applications, which will have an impact on the future of commerce and payments. Generative AI is a subset of artificial intelligence that utilizes large language models to create artificial general intelligence capable of generating text, images, or other content based on given prompts. Visa, being a pioneer in AI use in payments since 1993, sees this initiative as an extension of their leadership in using AI to drive payment innovation, create value for partners and clients, and enable global commerce. The initiative will be led by Visa Ventures, the corporate investment arm of Visa, which has been investing in and partnering with innovative payment and commerce companies since 2007. The goal is to invest in innovative and disruptive startups that are working on generative AI, commerce, and payments.
Read full article: https://investor.visa.com/news/news-details/2023/Visa-Launches-100-million-Generative-AI-Ventures-Initiative/default.aspx
4. Chaos LakeDB: First Lake Database for Live Search, SQL, and Gen AI Analytics
ChaosSearch has released Chaos LakeDB, the first data lake database designed to power generative artificial intelligence (AI), SQL, and Live Search. The database is available as a SaaS data platform for enterprises and as an embedded database from cloud platform providers. Chaos LakeDB is designed for various applications including general system observability, understanding product usage trends to derive business insights, and search log data for troubleshooting. It helps simplify architecture by removing the need for complex data movement or data pipelines. In addition, it consolidates diverse data streams and formats into one cohesive data lake database. The database also has the capability to automate the data pipeline, orchestrate workload across the lake backbone, and schema management. The integration with Amazon Simple Storage Service (S3) allows users to merge vast storage capabilities with the accessibility of cloud databases and helps minimize the need for complex ETL and ELT processes. The database is already being used by industry leaders such as Cisco and Equifax.
Read full article: https://www.datanami.com/2023/10/12/unveiling-chaos-lakedb-first-lake-database-for-live-search-sql-and-gen-ai-analytics/
5. Leveraging Generative AI and Data Analytics for Energy Efficiency and Reduced Emissions in China
The article discusses how automation, electrification, and digitalization are being used to meet the demands for sustainability and energy security in China. ABB Group, a Swiss-Swedish multinational corporation, is committed to advancing technological solutions that cater to the increasing global energy demand while contributing to reduced emissions for each unit of energy added. ABB has been using artificial intelligence as a tool for data analytics and looking at the new world of possibilities opened by generative AI. The company is supporting customers in China to help them unlock insights hidden in operational data for better decisions and significant gains in productivity, cuts in energy consumption, and lower maintenance costs. The Shanghai Free Trade Zone has created an improving business environment for multinational companies and has been one of the elements to open the Chinese market.
Read full article: https://www.chinadaily.com.cn/a/202310/10/WS6524ab29a31090682a5e7aa3.html
6. Espionage fuels global cyberattacks: AI creates new threats – and new opportunities for defense
The fourth annual Microsoft Digital Defense Report covers trends between July 2022 and June 2023 across nation-state activity, cybercrime, and defense techniques. The report shows that cyberattacks have touched 120 countries, fueled by government-sponsored spying and with influence operations (IO) also rising. The predominant motivation has swung back to a desire to steal information, covertly monitor communication, or to manipulate what people read. The report also highlights the increase in the global scope of attacks, particularly in the Global South, especially Latin America and sub-Saharan Africa. The report also discusses the trends by nation-state, including Russia targeting Ukraine's NATO allies, China targeting US defense, South China Sea nations and Belt and Road Initiative partners, and Iran bringing new attacks to Africa, Latin America, and Asia. The report also discusses the use of AI as a weapon to refine phishing messages and improve influence operations with synthetic imagery, and the need for responsible AI practices crucial for maintaining user trust and privacy. The report concludes that the only secure defense will be a collective defense.
Read full article https://blogs.microsoft.com/on-the-issues/2023/10/05/microsoft-digital-defense-report-2023-global-cyberattacks/
7. Scalable Infrastructure Supporting Reproducible Nationwide Healthcare Data Analysis toward FAIR Stewardship
This paper describes the process of standardizing a common data model and developing a national data infrastructure using national claims data in South Korea. The authors established an Observational Medical Outcome Partnership (OMOP) common data model database for national claims data of the Health Insurance Review and Assessment Service of South Korea. They built a distributed data analysis environment and released metadata based on the FAIR principle. The metadata was made publicly available, and the authors argue that disclosure of this infrastructure to researchers will help to eliminate information inequality and contribute to the generation of high-quality medical evidence.
Read full paper: https://www.nature.com/articles/s41597-023-02580-7
8. PGIM real estate arm launches AI research lab
PGIM Real Estate, the real estate division of asset manager PGIM, has launched an innovation lab called RealAssetX in partnership with University College London and other institutions. The lab will utilize data from PGIM Real Estate and third parties to develop new technologies for the real estate sector. Additionally, PGIM Real Estate will collaborate with UCL to establish the Centre for Sustainability and RealTech Innovation, focusing on research in sustainability, artificial intelligence, and other advancements. Partnerships with the University of Chicago Data Science Institute, the National University of Singapore, and the University of New South Wales will also explore big data analysis, AI, and research data for predictive analytics, as well as sustainability and innovation. The aim is to advance the industry's progress towards a net-zero target by 2050. RealAssetX will work with Taronga Ventures to enable third-party capital investment in technology companies.
Read full article: https://www.fnlondon.com/articles/pgim-real-estate-arm-launches-research-lab-ai-tech-ucl-20231010
9. Countering Disinformation: Scientists Brace for Data Challenges as Social Media Giants Limit Data Access
The article discusses the upcoming high-profile elections around the world and the role of social media in providing information to voters. Researchers who study elections are concerned about the lack of free access to social media data, particularly from Twitter, which has discontinued its policy of giving researchers special access to its data. The article highlights the importance of independent researchers having access to online data to study their impact on society, including concerns about mental health, harassment, privacy violations, and hate speech. The EU's Digital Services Act, which is due to apply from early next year, should ensure that very large online platforms open up relevant data to vetted researchers deemed to be independent of commercial interests. However, researchers need to work with policymakers to define the procedures for determining which data risk divulging trade secrets, how best to ensure equitable access to data, and how data quality should be assessed, so that disputes can be quickly resolved.
Read full article: https://www.nature.com/articles/d41586-023-03102-6
10. Natural language processing analysis of the psychosocial stressors of mental health disorders during the pandemic
This paper discusses the impact of the COVID-19 pandemic on mental health, and the use of natural language processing (NLP) to analyze chat text from a mental health helpline. The study found that conversations involving themes like self-image and emotional crisis played a role in explaining both suicidal behavior and depressive symptoms, while anxious symptoms could only be explained by emotional crisis themes. The authors argue that helplines could provide a higher-quality alternative to social media data for large-scale mental health research, and open up new venues for computational analysis.
Read full paper: https://www.nature.com/articles/s44184-023-00039-6
In the Media: 香港經濟日報HKET
In this webinar, Dr. Chao He demonstrates the game-changing potential of artificial intelligence and natural language processing (NLP) in empowering financial institutions to proactively identify and mitigate risks, make informed decisions, and seize opportunities ahead of the curve.
This research examines sentiment contagion in social media, revealing its influence and providing valuable insights into how exposure content and individual heterogeneity impact the contagious spread of sentiments.