Naturally, I'm now going to critique all of these definitions. 🙂
@Mariano Guerra "Data are raw facts. When analyzed they become information."
What is the difference between a "fact" and "information"? If something needs to be analyzed to be considered information, does that mean an encyclopedia is not information until a human or machine reads it? What does it mean to "analyze"? If the encyclopedia emits photons, and those photons interact with other matter in the world, then has the encyclopedia been analyzed? If so, then the encyclopedia is in a constant state of being analyzed, and so is every piece of data. To exist is to be analyzed. Thus the distinction evaporates. (See also: Appendix A below).
@Konrad Hinsen "Data is any symbolic expression using a well-defined notation. Information is data presented in a context that allows its interpretation."
Can data exist outside of the physical universe? If not, then every piece of data is continually surrounded by a context — its physical environment.
Moreover, your use of the term "interpretation" is synonymous with Mariano's "analyze", and so I will again assert that data is constantly being interpreted by its environment — even if unintentionally.
@Jan Ruzicka "The term 'data' communicates that any internal structure and relations to external objects are irrelevant."
I can accept this definition. But is it useful in reality? In what situations is it important to stress that you don't care about the structure and relationships that a piece of information expresses? In what situations would using the word "information" mislead a reader where your definition of "data" would not?
@daltonb "I would define information operationally as data that leads an agent to update its internal model of its environment."
This seems very close to the earlier definitions that "information is data that has been analyzed or interpreted". Thus, I shall refer you to my responses to those, and also to Appendix A. (Also: what conditions are necessary for something to be considered an "agent"? Can an arbitrary collection of atoms be considered an agent?)
@Andrew F "I contend that just because a term is tricky to define doesn't mean it's useless. In what actual situation does the common understanding of "data" fail?"
In general, I agree. However, there are hundreds (thousands?) of research papers, books, and blog posts which laboriously try to articulate a difference between "data" and "information", not to mention companies that are trying to sell products and services related to "information systems". The amount of human effort spent on understanding the (imagined) distinction is considerable. That's the "harm" I'm talking about.
@abeyer "I'm curious to hear what you think is an equivalent term for 'data' that is more broadly understood... I'm not coming up with one."
Information 😇 (or "formation": see Appendix A).
Appendix A:
Some of the responses in this thread come close to describing information as a process, rather than a thing. One might suggest that "information" is the process of analyzing something, or the process of updating an agent's internal model. In simpler words: information is the process of informing. If we accept this definition, then we still need to describe what things (objects) are involved in an occurrence of informing. An obvious term that could help us is "formation" (or "form"). We could say: to inform is to create a formation (in someone's mind, in a computer, or more generally, at any location in the universe). "In-formation" is then the process of creating such a formation. But what induces a formation to be created? Well... interactions between formations of course! (This matches up with physics: the configuration of the universe is fully determined by interactions between matter and/or energy.)
Given these definitions, the term "data" would be a synonym for "formation". I could accept that.