|
Data Science Updates is the University of Wisconsin-Madison's resource for news, training, events, and professional opportunities in data science, brought to you by the Data Science Institute, powered by American Family Insurance, and the Data Science Hub.
September 4, 2024
|
|
|
|
Industry Leaders Will Discuss Open Source and AI at Innovate Week Event
The Open Source Program Office (OSPO) and the Data Science Institute (DSI) invite you to our upcoming Innovate Week event, How Open Source is Fostering Innovation in AI, September 26 at the Fluno Center. Industry leaders will present and discuss the transformative role of open-source technologies in advancing the AI landscape:
- Paige Bailey, Google
- Will Falcon, Lightning AI
- Alex Mashrabov, Higgsfield AI and AI Factory
- Nhi Lê (Moderator), Alpha Intelligence Capital
The event will begin with a reception at 5pm, with the presentations starting at 5:30. Details and registration are available here.
|
|
|
|
New Websites Connect You with Data Science Resources at UW
|
|
Unleash Your Creativity: Apply to Compete at MadPrompts!
|
|
|
|
Intro to Shiny Apps - R Focus
September 4, 1:00 p.m. - 3:00 p.m.; Zoom. Posit’s Ryan Johnson will review the basics of a Shiny App using the R programming Language. Next month, Posit Day will focus on Shiny Apps using Python. During the Researcher Spotlight, our guest, Elissa Chasen, PhD, will talk about how they use data science tool to further their research. Learn more and register for today's Shiny apps session for free!
|
Research Cyberinfrastructure Virtual Office Hours
September 9, 1:00 p.m. - 2:00 p.m.; Zoom. Join DoIT's Research Cyberinfrastructure during their weekly office hours to ask any questions you have regarding cyberinfrastructure. The team specializes in cloud computing, LabArchives, ResearchDrive, and related secure platforms. For more information and access to the Zoom link, visit the office hours calendar listing.
|
SSCC Workshops
The Social Science Computing Cooperative (SSCC) is hosting a variety of workshops this month. Topics, dates & times, locations, and event links are below. Introductory workshops are designed to be taken before and in conjunction with the Data Wrangling workshops. Learn more about the SSCC at the SSCC website.
|
|
Join the 2024 Machine Learning Marathon!
September 12, 5:30-7:30 p.m.; Rm 1240, Computer Sciences. In collaboration with MadAI and the Open Source Program Office, the ML+X community is thrilled to announce the return of its annual hackathon event — the 2024 Machine Learning Marathon! Running from September 12 to December 12, this 12-week marathon offers an opportunity for both beginner and expert machine learning practitioners to collaborate, learn, and innovate on real-world datasets through various projects hosted on Kaggle. Many of the projects reflect similar datasets and problems faced by researchers here at UW-Madison! Participants will form small teams of 2-5 members, dedicating at least 3 hours/week to complete their chosen challenge, with the potential to win prizes. Weekly "sprint" events will facilitate knowledge exchange and discussion on ML tools and strategies, and each team will be assigned an advisor to help them throughout the event. To learn more and register by 9/10, visit the 2024 Machine Learning Marathon webpage.
|
Registration for Mini-Workshop Series
Registration is open for the Data Science Hub's Fall 2024 Mini-Workshop Series. The Mini-Workshop Series are one-to-two day workshops throughout the Fall covering a range of topics listed below. All workshops take place online from 9am-1pm. Register for any and all that you are interested in. Tickets close the Friday before each mini-workshop. To learn more and register, visit the mini-workshop event page.
|
Topic |
Date & Time |
Location |
Intro to git and GitLab |
September 25 9:00 a.m. - 1:00 p.m. |
Zoom |
Intro to Docker |
October 23-24 9:00 a.m. - 1:00 p.m. |
Zoom |
Interactive Data Visualizations in Python and Streamlit |
November 6 9:00 a.m. - 1:00 p.m. |
Zoom |
Intro to Machine Learning with Sklearn |
November 20-21 9:00 a.m. - 1:00 p.m. |
Zoom |
|
Workshop Series: Python and R Programming Languages for Data Analysis
|
Library Micro-Courses
Online, Asynchronous
The UW Library system offers online, asynchronous micro-courses that cover topics often not taught in classrooms. Current courses discuss intellectual property, responsible data planning, patents, grants, data management, and social justice. These courses are short, interactive and available to be completed at your own pace. For more information and access to the full course list, visit the library micro-course website.
|
|
|
|
Have questions about anything data science-related? Come see the Data Science Hub facilitators at Coding Meetup on Thursdays from 2:30-4:30 p.m. CT. To join Coding Meetup, join data-science-hubgroup.slack.com
|
|
|
|
SILO Seminar
TODAY September 4, 12:30 p.m.; Discovery Building, Orchard View + Zoom. Come join the SILO group in welcoming Dr. David Inouye of Purdue University for their first seminar of the semester. In his presentation titled "Domain Counterfactuals for Explainability, Fairness, and Domain Generalization," Inouye aims to discuss domain counterfactuals and their potential impact on distribution matching and building machine learning models.
|
Genomic Seminar Series
TOMORROW September 5, 1:30 p.m., 425 Henry Mall Room 1111 + Zoom. Join the Center for Genomic Science Innovation in welcoming Barak Cohen of Washington University for his talk titled "Integration of cis-regulatory information in the genome." His research focuses on predicting the effects of genetic variants on gene expression. This seminar series occurs on the first Thursday of every month. For more information and the Zoom link, visit the seminar's calendar link.
|
|
|
|
Navigating and Contributing to ML+X Nexus: Crowdsourced ML and AI Resources
September 10, 12:00 p.m.; Zoom + Discovery Building, Orchard View. The ML+X community's monthly forum returns starting September 10th! During the kickoff, we will discuss upcoming community events and properly introduce the new ML+X Nexus platform we have been developing in the community, including a short demo on how to contribute! Register for the ML+X kickoff by September 8th (lunch provided) to guarantee your lunch ticket and join the discussion!
About ML+X Nexus
ML+X Nexus is the community’s centralized hub for sharing machine learning (ML) resources, designed to make the practice of ML more connected, efficient, accessible, and reproducible. It is intended to host a wide range of resources (original or external), including educational materials, code for common ML procedures, pretrained and foundational model use guides, recorded talks, and more. While practitioners can use Nexus to expand their expertise, educators and PIs will find it useful for sharing ML knowledge and procedures across fields, reducing redundancy, and improving learning outcomes. In this session, you'll learn how to navigate and contribute to Nexus, which already includes resources like the Intro to Deep Learning with Keras/PyTorch, a library of our recorded forums (other talks are also welcome!), guides on best reproducibility practices, and more. We’ll also discuss future areas of development, such as our member blog section, contrastive learning and trustworthy AI, and how you can get involved through our Nexus GitHub. Don’t miss this chance to connect with the community and make your work more impactful!
|
ML+Coffee: How Can I Apply Machine Learning to My Data?
September 11, 9:00 a.m. - 11:00 a.m.; Rm. 1145, Discovery Building. As part of the ML+X community's monthly ML+Coffee event, researchers and students with little or no background in machine learning (ML) are invited to join and ask how ML can be applied in their domain of work. ML+Coffee offers a casual and social atmosphere where ML practitioners can problem-solve with one another. If interested, please contact the ML+X leadership team with a short description of the problem and dataset. For additional context, check out some of the previous projects discussed at ML+Coffee.
|
Help Build Community Around Applied Machine Learning
The ML+X community is seeking additional members to join its leadership team! Members of the leadership team will have the opportunity to help grow and sustain a lively and engaged machine learning (ML) community of practice, ensuring that practitioners across campus have ample opportunities to discuss modeling challenges, learn from one another, and support each other’s work. Anyone who is passionate about ML and communal learning (including students!) is welcome to join the leadership team.
— No minimum experience in ML is required. We are particularly seeking increasing representation from the Digital Humanities, Psychology, Math, Political Science, Chemistry, and more. If interested, please fill out the ML+X leadership application form. Contact endemann@wisc.edu if you have any questions or require additional information.
|
|
|
|
|
PROFESSIONAL
|
Data, Risk Management, and Grants Facilitator
Apply by September 8 – The UW Madison School of Business is seeking a Facilitator to assist Business faculty and graduate students in locating both data sources and grant opportunities. This role involves creating and renewing databases of available resources. It is critical the person in this role adheres to data security and risk management guidelines.
Requirements
- Bachelor's degree
- Experience with computer systems and grant funding processes
- Preferred experience with risk assessment
Responsibilities
- Develop and enhance current data policies
- Work collaboratively with PIs to ensure institutional compliance
- Simplify the funding process by researching and managing grants
Read the full position description and apply at the job posting on the Jobs at UW portal.
|
STUDENT
|
WID Administrative Student Assistant
Apply by October 1 - The Data Science Hub, housed in the Wisconsin Institute for Discovery, is searching for a student assistant to help with administrative tasks. Position includes campus errands, assisting with events, and collaborating on the biweekly Data Science Updates newsletter.
Requirements
- Be adaptable, organized, and willing to grow in the role
- Current undergraduate student at UW majoring in data science, computer science, or related field
Position Summary
- Up to 15 hours/week, remote and in-person
- $15 per hour
Read the full position description and apply at the job posting on the student jobs website.
|
|
|
|
|
DATA VISUALIZATION OF THE WEEK
|
|
|
|
Decline in Data for AI Bots to Scrape: Trends for Robots.txt and Terms of Service ( paper) Data Provenance Initiative.
Many AI systems are built and trained on public web data. However, this study conducted via the Data Provenance Initiative discovered that there has been a rapid increase in data access restrictions in recent years. These data protections greatly decrease the available data on the open web that is available to commercial and private AI, leading to biases.
|
|
|
|
Data Science Updates is a collaborative effort of the Data Science Institute and Data Science Hub.
Use our submission form to send us your news, events, opportunities and data visualizations for future issues.
|
|
|
|
|