Session Summaries by Emilie-Furtado

written by: — October 11, 2024

Data, Metadata and Tropy, 25-09-2024

During the session, we learned about and discussed the use of data and metadata. We learned that data is not only something for computer scientists and mathematicians, but that it has a place in many industries and in our private lives due to digitalization. Big data is becoming an integral part of historical sources. We have taken a closer look at what data and metadata are, where they come from and how they can become a historian’s tool for scientific research. Furthermore, we have defined that data is something that anyone can create, record, store, observe and analyze. It is very easily accessible and is becoming increasingly widespread through digitization. Metadata is used to describe data. It is created automatically by technology, but can be enhanced by a person or a group. We got the explanation visually with the help of a YouTube video and for further understanding we looked at data sets (Excel) and Twitter posts. Next, we got a brief introduction to Tropy and saw what we could do with it. Finally, we discussed and found out how classmates used data for their thesis. While we were learning all this, we also produced data ourselves while taking notes on Word, and we produced even more by completing this assignment. The course was very interesting. I know that data is very present in my life, but it is something that I am not actively aware of and therefore never really pay attention to. Data can make life and work easier, but I still need to learn how to deal with it. And I should no longer ignore it.


title: Session Summaries by Emilie-Furtado abstract: Summary-2 authors:

  • emifurtado date: 2024-10-11 —

Web Archives, 02-10-2024

During the class we were divided into groups. The task was to analyze non-profit web archives within about 30 minutes and present them to the class. My group was given the topic “Fluidity of the web” and we had three questions to work on, so we split up the questions and worked indidivelately at first, but we helped each other when we found information that matched the question of the respective assigned person. To answer my question, I looked at the profiles that edited the Wikipedia page, I found two Bots. Bots add information or links to open repositories and fix common errors, but bots cannot recognize when databases contain errors or incomplete information. So they rely on the help of humans. But humans can be biased. I was able to find personal information about the user, such as their religion, origin or job, but some profiles did not contain personal information.  From the first presentation we mainly learned that it currently results in a lot of lawsuits from companies like book publishers, because these websites provide protected documents and therefore violate laws. In the following, we have learned about the role of the Bnl and Luxembourg Web Archive. The BnL collects, preserves and catalogs all publications related to the Grand Duchy and we learn that the internet/website are not available for eternity. Sources on the internet disappear or are no longer readable/up-to-date. We also see this later in the example of the website The September 11 Digital Archive. Then we learned about Micro-archiving: Family and personal archives in the digital age. Where the students find some information about themselves and family. At the last presentation we learned that many documents are not relatable because many were marked as unkown. We can’t trace it’s origin back properly.


title: Session Summaries by Emilie-Furtado abstract: Summary-3 authors:

  • emifurtado date: 2024-10-13 —

Impresso, 09-10-2024

This course was again more about experimentation and independent exploration. In the first part of the lesson we were introduced to the Impresso website. Before that we had to read an article about Impresso that gave background information and explanations to help us follow the course better and sign a contract. In the course itself we were again divided into groups after the presentation and given the task of working with the tools and trying them out to get more familiar with the website. In my group we explored Inspect & Compare. This took a while and required a little help to get familiar with it. The first time we got no result, we tried to get results by translating the word „NATO Doppelbeschuss“ decision into French. The second time we got a result. This time we used the words „NATO Doppelbeschuss“ and Helmut Schmidt. We were then given the task of transferring our results to PowerPoint. In the last part of the course we had to present our founding in a short presentations. The short presentation not only helped us find out what our classmates are interested in, but also made us aware of limitations and other opportunities and gaps. Impresso is an interesting and helpful tool that I can use as a student of the university of luxembourg both for my thesis and for other presentations at the university. At the end of the course we talked about GitHub, which helped me better understand why we should use GitHub.


title: Session Summaries by Emilie-Furtado abstract: Summary-4 authors:

  • emifurtado date: 2024-10-16 —

Map, 16-10-2024

We did an exercise where we drew a map of our campus and the results varied. There were different starting points (packaging, train station, university buildings etc.), which showed us that maps are subjective. I also found it interesting that some took a different perspective when drawing a map. Some drew the buildings from a two-point perspective, others from a bird’s eye view. After the presentation, I realized that I had misunderstood some things in the article and left out important details. For example, that a location is a specific place and space is a social construct. Both are two different things. Maps are sources. Much depends on who made the map (who commissioned it?) and who uses the map. Maps are not always accurate or/and have gaps. GIS (Geographic Information System) works in layers, which we can turn on and off.(can be used as a comparison tool) GIS is not equal maps. GIS also works without a map. We need geographical points for maps, scales and a compass direction. Data in a map can be represented in different ways. And boundaries change when we compare maps with each other. In the third part of the course we were divided into groups and had 1 hour to work on a story map. I was in the group with the Travel industry and i looked at the story map of the Olympic marathon. The group worked with bullet points, while our group wrote sentences. In their map you can see streets, houses and fields. Which makes sense in their map and would be rather superfluous in our map. Ours, although has a simple design, but it was still overwhelming due to the many symbols and text. Our map even included 2 more maps and a catalog and a diagram. Our map shows the Globus trading market, while the Olympia Group map is designed to show a specific location. Used for measuring and document a distance. (A staring point and final destination) They also had a chronology and multiple maps so you could compare them with each other, see an evolution. Both maps tell an individual story. The Olympia Group map uses pictures and text, and our map uses symbols, text and drawings and uses colors to provide more detail/a hierarchy.


title: Session Summaries by Emilie-Furtado abstract: Summary-5 authors:

  • emifurtado date: 2024-10-23 —

    Palladio, 23-10-2024

    For this lesson we looked at networks and Palladio. We had to read 2 articles, an introduction where the origins, the pioneers and what it is used for were explained. We also had to read a tutorial, which I followed while reading. The lesson itself revolved around the topic: Why are networks important and why do historians want to study them? We collected ideas on the topic of wedding planning. Through the simulation (with the table extension) that we created, we see that there are already existing relationships, but there are also people who do not know each other yet. We can group them by age, languages, interests and relationship status, to see which people can be divided into which table. Left is always a group of “odd one” (which is not easy to put in a group). There are different dynamics in different tables. In the sense of relationship, communication and dominance. Someone can control multiple conversations; who talks to whom and who limits themselves to a few people. When different groups meet, different information is exchanged, but this does not necessarily mean that this information reaches everyone in the group. In networks there are also brokers. They are powerful (they have control, can decide whether to pass on information or keep it to themselves.) Over time, structures and dynamics can change. New network can emerge, e.g. some find themselves in the wedding. (Lovers) Afterwards we had a PowerPoint presentation. Where we learned what a social network. A specific set of links. (we saw different types in class) We can use it to explain why people behave differently in different groups. (Ex: smoking behavior, voting behavior etc.) Our behavior can also destroy networks. Relationships are important and how you are connected to them. Networks are flexible. Cases can be represented visually and networks are not limited to people. Social network analysis is both theory and method. Today, everything can be defined as a network. (expl: Family network) From the historical perspective, network can guide us to the self-perception of historical actors, to the metaphor for a group, to the constructed scientific research object and to the theory/method driven research approach. Finally, we had the task of creating our own wedding list. My group created this on Excel and used the table in Palladio. Thus, we created our own fictional network.


title: Session Summaries by Emilie-Furtado abstract: Summary-6 authors:

  • emifurtado date: 2024-10-30 —

EP Archive, 30-10-2024

For this class, we received a presentation from Ludovic Delepine and Marco Amabilino from the European Parliament Archives. First, we were introduced to the team and then we watched a video that we had to see before the lesson. In the presentation, they explained the steps they had to take in developing the website. First, they wanted to make the documents managed by the Parliament more visual and accessible. Previously, the only way to access the documents was by email or in person. Technically speaking, they are a small team. They wanted to solve this problem digitally, so they needed a server and at the same time, the documents should be accessible from anywhere from now on. That’s how Archibot was born. Important figures were introduced such as Edgar F. Codd. In the beginning, documents were just pics because they were scanned. Now they need a solution to make the characters readable/recognizable. Another important figure who was introduced is Gerard Salton. Next, we learn that metadata is generated = deep learning. A model is trained to generate metadata. Next, they developed their own “ChatGPT” = “Ask the EP Archive”. The idea was to create a simple search file where you can ask questions and get an answer. It works in different languages and you get the answer in the language you ask in, even if the document is not available in the same language. ChatGPT is unreliable and sometimes gives made-up answers. They create a prompt that says it is unable to answer a question. They have also added to the system that it should not engage in discussions. The data is not shared, everything stays in their system and runs there. The system selects the 10 most important documents. If you want all the documents on a topic, you have to search yourself. We were also advised not to enter any personal data into the system. Questions were asked and answered in the discussion round. For example, we learned that translation is based on training. Some languages are better trained than others, so if they can’t translate something, it is automatically translated into English. We also learned that they use Amazon because of the contracts. We also talked about the main fears and prejudices. People are afraid of being replaced by machines. Humans and AI make mistakes. Both can support each other. Humans can spread information, just like AI (which can also spread wrong information), even professors have wrong answers. Nobody is perfect, even AI is not perfect. AI is not meant to be perfect or replace humans. They are a tool to promote and facilitate searching. They don’t want to replace the archive. 

——- title: Session Summaries by Emilie-Furtado abstract: Summary-7 authors:

  • emifurtado date: 2024-11-06 —

DH-Theory, 06-11-2024

I was not present for this class and my summary is based on the PowerPoint presentation, the reading, and help from classmates. From the text, we learn that data is real research. Each step of selection, enrichment and classification is based on interpretation of the data. These interactions change the data. The PowerPoint deals with fake news and how we can deal with and recognize it. David Irving selects and cherry-picks the information he wants to use and denies Hitler’s crimes. He bends historical evidence until it conforms to his ideological leanings and political agenda. Poor historical scholarship can be proven by written records. But for data-based research we need other methods. We look at the wedding guest list network from session 6 to write down our decisions for each of the 5 steps. We had a pretty small marriage list, they are pretty much connected to the workplace. They are from the 21st century as we are more open to different genders (non-binary, for example) and relationship types (homosexuality), which are represented in the Excel file.  We focused on people who like and dislike each other in a workplace. Sources that can be used for our data are social media (follower list), (witness) statements and maybe text messages. The nodes can base on who is sitting at which table, with the edges defined by the same interests. We lack consistent data for example, standardization of name spellings and linking. Classification could be based on friends and enemies at workplace. Finally, we see the FAIR principle; the data does not disappear when the projects end. It still exists and was created by someone using an app, e.g. Excelsheet, it has a copyright, it can be linked and connected to other datasets and data sources and is it possible to connect our network to other weddings, how well?