Name:
AI vs IA: How Will Humans and Technologies Interplay in the Future of Scholarly Communication?
Description:
AI vs IA: How Will Humans and Technologies Interplay in the Future of Scholarly Communication?
Thumbnail URL:
https://cadmoremediastorage.blob.core.windows.net/b7afb597-e9b1-4e31-a81e-9159c2bfee25/thumbnails/b7afb597-e9b1-4e31-a81e-9159c2bfee25.jpg
Duration:
T00H51M59S
Embed URL:
https://stream.cadmore.media/player/b7afb597-e9b1-4e31-a81e-9159c2bfee25
Content URL:
https://cadmoreoriginalmedia.blob.core.windows.net/b7afb597-e9b1-4e31-a81e-9159c2bfee25/3 - AI versus IA.mp4?sv=2019-02-02&sr=c&sig=58Is7rHPJM%2BHu%2BMYtIiKytHyhWTjer%2FM0NhZ3MD0ysA%3D&st=2025-01-22T13%3A07%3A51Z&se=2025-01-22T15%3A12%3A51Z&sp=r
Upload Date:
2020-11-18T00:00:00.0000000
Transcript:
Language: EN.
Segment:0 .
DANIEL HOOK: It's a great pleasure to be here. This topic is a little offbeat for me. While we do quite a lot of AI-type work at Digital Science, in fact, I am sadly under-qualified to talk on this area. And so, this is going to be something of an interesting journey for me, as well. It is, however, something that I have been quite interested in in the last few years. And so, I'm quite interested in how we think of the space in which we all work, in terms of AI, and in terms of IA.
DANIEL HOOK: And by IA here, I'm really thinking about Intelligent Augmentation. So, the IA-- AI, I mean to start, most of us in this room will have looked at companies who we might want to invest in or we might want to work with. And I think the enduring thing that all of us know about the .AI at the end of the URL of the company, basically adds 10 million to the valuation.
DANIEL HOOK: This is not the type of AI I really want to think about today. I want to think slightly more broadly. And I figure that actually, I really don't know the level of people in the room. Who has any kind of background in AI? A little bit. A few people have some background in AI.
DANIEL HOOK: Excellent. So I can say lots of things and get away with them. Tyler, I've got my eye on you. Good. So I should be able to get away with some things, but I thought actually it might be quite interesting for some people who have a little bit of a background in the history of AI because I think it is a fascinating field and something that is now starting to impinge on the world in which we live.
DANIEL HOOK: I think the most important event in AI that we have seen in recent years is actually Go-- to do with Go. The game of Go is a complicated game. In some sense, the rules are simple, but in fact, how you handle this game in computer terms-- so that you can get a computer to play you and even beat you-- has been really challenging. In fact, significantly more challenging than chess.
DANIEL HOOK: We saw the first chess computers probably in the '50s and '60s. We saw them become mainstream in the '70s and '80s. And by 1997, Garry Kasparov was beaten by a chess computer. However, really interestingly, Garry Kasparov was not beaten comprehensively by a chess computer. The set of games that they played, the chess computer won 3 and 1/2 games to 2 and 1/2 games by Kasparov. So, the computer, which was Deep Blue from IBM, only won by a single game.
DANIEL HOOK: And in some sense, if you know something about that background, it was, in fact, just by brute force. Arguably, the IBM team programmed the computer not to play chess, not to beat a chess player, but to beat Kasparov, specifically. They studied how Kasparov played and they designed an engine that would beat Kasparov. And this is not what people are trying to do with AI. The Holy Grail of AI is generalized artificial intelligence, this idea that you can get a machine that is going to be able to interact with a human and to be as intelligent as a human in its perception and its interaction, its ability to integrate with society, in some sense, its ability to be able to perceive and understand what we understand and perceive, and potentially to do that better than we can.
DANIEL HOOK: That's a pretty high bar. But move forward a few years, and in 2016, a monumental event happens. Lee Sedol, the world champion of Go, was beaten by the first Go computer, AlphaGo. The DeepMind company, who is affiliated with Google, came up with a computer that was actually able not to simply beat Lee Sedol, but to actually beat Go players in general.
DANIEL HOOK: And this beating wasn't by a single game. It was a five game match, and the computer won four out of the five games. So it was a comprehensive beating. It was a significant step forward, something that we haven't seen before. But this is actually where the story of AI kind of takes off. AI, as we think of it these days, people will know terms, perhaps, like machine learning and things like this.
DANIEL HOOK: These ideas have been around for 30 or 40 years. But what happens in 2016 is that the algorithms and the computer power and the amount of data that we had caught up with the promise of the algorithms that we've had for 40 years. And that's really the sea change. And much as in the same way that the Space Race started in the 1960s and the late 1950s, when the Russians launched their orbiting spacecraft around the world.
DANIEL HOOK: And you see things like Sputnik and Gagarin going up. This caused the Americans to be concerned and to be worried about how the world was developing and it signaled the beginning of the Space Race. With the advent of AlphaGo, the Chinese had a similar fire lit in their bellies. They were worried about how computing would move forward, and this AlphaGo game beating their esteemed champion-- and Go is something that isn't just a game in China, it's very much part of the social fabric-- was something for them that they really needed to be worried about.
DANIEL HOOK: And so if you look at Chinese investment into research in AI, you see a complete sea change in 2016. The amount of research funding that is now going into AI and into innovation around the translation of AI is actually scary. We've done some analysis with our data at Digital Science, and I can tell you that if you look at the top 10 universities in the world doing work on AI, nine of them are in China.
DANIEL HOOK: One of them is in the US. MIT, on the scale of these universities in China, looks like a small island in comparison to continents, in terms of the amount of work and the amount of funding that's going on. The collaborative landscape is shifting towards China in a massive way around this particular research area. So, this is really fruit of the-- I would argue fruits of what happened in 2016.
DANIEL HOOK: The original work by Jeffrey Hinton has been around for a while. I think that when Hinton wrote this paper, he could not have guessed 30 years ago that it would now be sitting there with 8,000 citations. It would be probably the seminal work that actually is powering the deep learning revolution that we're seeing going forward now. And effectively, data has become the new oil of our age.
DANIEL HOOK: I think everybody knows that at some level, but there are different levels of interfacing with that comment. I think we all know that in running our businesses, we want to be more data-supported. We want to think more about the data that's actually helping us to move forward, and we try to use those data to create metrics and create insights. But with the data that's available in AI-- the data that AI needs to create its ecosystem and to create these deep learning environments-- we're seeing an exponential speedup in what data means to this space.
DANIEL HOOK: And, in fact, China has a very interesting position in this because China has so much data through things like WeChat and through their various ecosystems where researchers can get access to significant amounts of data to work with. It provides them a significant commercial advantage. They have 1.3 billion people in the country, all of whom are throwing off data. All of that data is effectively and potentially available to researchers inside China, but not outside China.
DANIEL HOOK: So, this is a significant competitive advantage and one that we need to consider carefully when we're going forwards. Open data, I've always been a big fan of open data and open science, but of course, one has to be quite careful about this. I remember running a user forum back when I was CEO of Symplectic-- this is a decade ago. And I took the room and I did one of these things where you put a statement up on the board and you say to everybody in the room, please go to this side of the room.
DANIEL HOOK: If you agree with the statement, go to this side of the room. If you disagree with the statement-- and anywhere in between is fine. And then I get someone with a microphone to go and attack people and get comments from them. And someone went, and the statement that I put on the board was "my institution believes firmly that open data is for the good of the ecosystem, and that we should make our data open." And one person went to one end of the room, which was the "yes, we should definitely share our data" end of the room and then quickly moved to the completely other end of the room and consequently marked himself up for attack.
DANIEL HOOK: So I gave the microphone to my assistant. She went over and talked to him and said, why did you move? And he said, well, I thought about it. And my institution firmly believes that every other institution should make their data available. [LAUGHTER] So-- and I still think that many institutions are there. And in some sense, although I think open data and open science is a great thing, it is something that we need to try and do together.
DANIEL HOOK: It's something where I think we're still building trust. And I think in how China is moving forward, this is something where they're going to continue to need to build trust because they're going to need to make their data open at some level to make sure that it's an equal partnership. So, if you think about data as a new system of or a new raw material that we're working with, then it's quite interesting to look at the economics of industrial revolutions.
DANIEL HOOK: And if you look at industrial revolutions from the past, you can see that there's a time difference between when the initial work of the Industrial Revolution takes place-- the initial things that trigger the revolution-- and then many years after you see the effects. Now, just because we are used to things happening quickly, it does not mean that they actually happen quickly. Most of what we experience today in the world is actually fruit of what was happening in the late 1960s.
DANIEL HOOK: In the late 1960s, the US was spending around 25% of its national disposable income at governmental level on research. That's nowhere near the percentage that you guys are spending today. As a result of spending that money in the research into research infrastructure, we gain things like the internet, the world wide web, microcomputer revolution.
DANIEL HOOK: These things took 20 years to come about, and we're still reaping the benefits today. So, in thinking about artificial intelligence, although we're seeing a frightening pace of things moving forward, it is important to realize that the actual effects of what we're seeing today are probably 20, 30 years off. So the world that we're building today, the conceptualization that we're putting together, is something that actually will pay back over a period of time.
DANIEL HOOK: The other interesting thing about looking at things like this is that, in fact, it is fascinating that scientists and researchers are the ones that come up with these revolutions quite often, but we're remarkably impervious to accepting them ourselves. A lot of researchers are now kind of having the internet done to them. They invented the internet originally, but they didn't really think about what it was at the time.
DANIEL HOOK: It was an intellectual exercise. They moved on from that. They're doing other things. But, in fact, the professionalization of research, how institutions are now moving forward, the types of thoughts that people are having around how their research is managed, and the technologies that we have to help manage and publish and integrate with research are all things that are now being done to us.
DANIEL HOOK: They're not things where we're holding the reins and moving forward and moving ahead of the curve. The fact that the paper hasn't really changed in 350 years shows you how-- not actually the people in this room, but the people we serve-- are so ensconced in the idea of the printed thing that they hold. Tangibility and physicality are tremendously powerful physical relationships that we have with papers.
DANIEL HOOK: And actually getting over those relationships are really challenging if you're a technologist and you want to move things forward. So the way I think about where we are right now is that we are effectively tool providers. We are trying to get people to move from this state-- where they are gradually using our tools and hopefully, augmenting their intelligence by tool usage-- to the stage where we actually move into the next phase, and we start seeing tools move in another direction.
DANIEL HOOK: The question is, when will we get to this, and is this where we want to be? I would suggest not necessarily. But it is important to remember that we have choices in how this works. A lot of people feel, as I was saying, the internet is done to them. Internet technology is done to them. But actually, I think in our community, almost exclusively, we have the ability to have an intelligent conversation with our stakeholders and work out where the best place for us to end up is.
DANIEL HOOK: This is not a globular cluster or a galactic simulation, this is actually a data dump out of Dimensions where we have looked at AI research. And we have taken all of the AI research in the world, around a million papers and their connections, and we have produced this on a very large scale here. And we've colored it associated with different research areas that happen inside AI. AI is many things to many people.
DANIEL HOOK: And we can break that down a little bit. So, we've done some topic modeling to try to understand what's going on in different fields. And we've come up with 15 high-level fields and we've plotted the activity of research for each country in the world. And you can see quite interesting things come out even at this level. You notice that China is perhaps a little more interested in facial recognition than the rest of us.
DANIEL HOOK: That's a thing to take home and think about. It's actually interesting that lots of researchers around the world are interested in facial recognition, not just the Chinese. They just seem to be the leading ones. If you look at these footprints that I've printed-- if you saw them up close, you would see that, actually, the scale differs on each of the axis for each of the countries.
DANIEL HOOK: So even though they're broadly similarly shaped, the scale of the research in each case is different. And what you would notice if I plotted this on a rank basis is that in 13 out of 15 areas, China is the largest producer of AI research in any of those fields. So, China is definitely where the locus of information and the work is. The US is number two.
DANIEL HOOK: But we also see other countries-- if you take the EU as a whole, it's certainly up there with them with America as a significant power in this space. But no other single country has the kind of capability and research that the US and China do in this space right now. So how is this actually going to disrupt our space?
DANIEL HOOK: There's a great report from PWC that actually starts looking, in global terms, about how things move forward, and I really recommend this to you because it's a wonderful summary of the whole area in one go, and it's very applicable. So they actually talk about AI happening to us in three waves. They talk about the algorithmic wave, they talk about the augmentation wave, and they talk about the autonomy wave.
DANIEL HOOK: Now, broadly speaking, algorithms-- the algorithmic wave is the thing that's kind of here right now. It's hitting us right now. This is where we're seeing recommender algorithms do things for us, which I think are tremendously dangerous, and maybe we'll have time to come back to that a little bit later. Augmentation is where you're starting to see artificial intelligence tools placed in people's hands.
DANIEL HOOK: And then autonomy is where you see robots and robotics starting to take more of a hold. And I think all of them have some applicability in our space, but in fact, some more than others. And I think-- I'll tell you the punchline first. The punchline I think, is, for us, that augmentation is probably where we will sit for a long time in this space.
DANIEL HOOK: I think that autonomy is not so relevant to where most of us are. I think robots have limited purchase in most of what we're doing because we're in a more creative industry. But I think augmentation is something that can really bring significant gains to all of us, especially with cost pressures in this space which we all inhabit.
DANIEL HOOK: We need to become more efficient. We need to broaden and diversify the things that we bring to people, and a lot of that can be done through augmentation. If you look at where PWC come out on some of these things, you can see, in fact, that Wave 1-- the early 2020s-- the algorithmic revolution doesn't actually lead to too many job losses. This is really tiny-level effects.
DANIEL HOOK: This is a few percent. Wave 2, in the late 2020s-- this is when we start getting augmentation in place. So this is for the workforce, overall. They think there's going to be a much more significant hit, and you could see 15% to 20% of jobs disappearing as we know them currently. And in Wave 3, in the mid 2030s, you can see now this kind of long time scale that I was talking about coming into play.
DANIEL HOOK: You can see much higher disruption in the job market. And the question is, does that actually apply to us in our area? And I would say in a limited way it does, but I don't think that we're going to see anywhere near the Wave 3 kind of disruption that you'll see in the overall job market. If you look at particular industries, you can see that there are different profiles, and I would argue that, in fact, we're much closer to financial services than we are transport.
DANIEL HOOK: If you think about the types of thing that we're involved in in our space-- lots of data, lots of creativity, lots of analytics, lots of things that are tricky for other people to intuit, or indeed, that you could potentially make algorithmic, but you do certainly want a person looking at them before they actually-- before they actually get committed anywhere-- then you can see that I think that the yellow line here is the one that's more appropriate to us.
DANIEL HOOK: So I don't think it's the end of the world yet. I always preface with-- but you can also see here, if you look at it on a slightly different axis, manual tasks, you're going to see things going down significantly. So anybody who wants to do a postdoc is going to have a very different relationship with their work in various subject areas in the world to come.
DANIEL HOOK: Routine tasks, again, waiting the postdoc, here mostly, or the PhD student. And various things here in management-- academics don't really do management, if you might have noticed. It's something that you have to grow into. You don't get trained for it. So there are all sorts of different aspects to this, which actually are quite marginal in the type of place that we're working.
DANIEL HOOK: And so I show these really to show you the kind of places where most people are doing analysis and where most people are thinking about things going-- actually, places which don't hit us centrally as a market to work in. The one final piece of this that I thought was completely fascinating was which countries are going to be hit most badly, given the analysis that PWC did.
DANIEL HOOK: And you can see, in fact, that countries like the US are actually fairly close to the top of the table. The UK is fairly middling. We're seeing countries like New Zealand, Japan, and Russia being the countries which most benefit from the AI revolution. So you might consider that over the next few years, in the way that we're seeing research move towards China, as I've indicated before, we might also see other parts of what we do shifting to other locuses in the world, simply because they're better set up to take advantage of the tools that we in this room are going to be building.
DANIEL HOOK: So, our standard markets may not be our standard markets unless they can retool and move themselves quite quickly. Given that we're in New York, I thought it would be good to call out this research institute. This is at NYU, the AI Now Institute. Again, I strongly recommend reading their material and looking at what they are doing. I think one of the really fascinating things when you start to look at artificial intelligence is this real concern that as we get towards machine learning and deep learning, we start to lose our understanding of what's happening inside the box.
DANIEL HOOK: I remember when I was at school, I had to-- I was that age group where I had to learn to both use a calculator and do the calculation in my head. And if I couldn't do the calculation in my head then I wasn't allowed to use the calculator. And so, it was only by understanding how the calculation worked that one got to play with the calculator.
DANIEL HOOK: And I think we as a society are a little bit in this place right now with AI. When you think about deep learning, and you start to understand deep learning technologies, the very structure of the deep learning ecosystem seems to be that there is a black box into which one cannot look. So you put in the data, the machine then self-teaches itself from the data to understand certain things, and then creates you a mechanism by which you can put in an input and get an output.
DANIEL HOOK: But that black box that happens in the middle is something that we lack the technical capability right now to understand. It's something that's a closed world to us. And that has led to some really interesting developments. AI Now wrote this report, within the last 12 months, called "Discriminating Systems: Gender, Race and Power in AI." And this is a fantastic report. It was referred to in "The Guardian" as a result of some of the strikes and some of the people at Google going out on strike because of some of the work that Google was doing.
DANIEL HOOK: I think the most interesting of these cases is actually that Google were trying to create an AI to help them with their hiring practices. And so they put all of their hiring data into an artificial intelligence, and the artificial intelligence only would allow them to hire white males, which arguably isn't terribly PC or really great for the business. And so, Google obviously realized that this was a big issue.
DANIEL HOOK: The outing of their staff in other ways was actually to do with other projects that they're working on. But because you have this black box that's going on, there's a lot that's hidden including the fact that your AI is making ethical judgments on your behalf, which are now completely hidden and which you're now completely divorced from. And I would argue that in the work that we do in research and supporting research, this is actually the biggest danger of what we do.
DANIEL HOOK: Already, this type of technology is shown to give poor results when looking at hiring. It's giving biased results with facial recognition. It already shows that there are errors or biases in financial algorithms that the financial sector use, and that they use to manage our money and play with our money. So these are already things that are highly concerning because they're starting to affect the society in which we live.
DANIEL HOOK: Who knows? There may be issues in voter systems that are based on this type of work as well. So, it's actually quite heartening to see that there are now articles-- this is a research scientist called Been Kim, who is a researcher at Google. And she is starting to work on algorithms at Google Brain which allow us to take that black box apart and to try and understand that actually even if data is pulled into the system and tells us that only white males are doctors-- which is exactly what this algorithm did-- we can actually start challenging it and taking it apart and holding it to some kind of ethical standard.
DANIEL HOOK: And the technical term for this is TCAV, is Testing with Concept Activation Vectors. So this is a technology that I think you would start seeing to be much more pervasive in the things that we're doing, so that we start having checks and balances on this black box that's being introduced. If we think of our own systems that we're using in our space, in some sense, if you trace back to way back when, in some sense impact factor is an algorithm for us to shut off our brain and trust something that we don't completely understand that's a bit of a black box.
DANIEL HOOK: If we move forward, and we start thinking about the revolution that happened in the '90s and '00s, recommender systems are another thing where we've been taught to shut off our brain and take the thing the computer gives us. Google, as a search mechanism, is crafting all of its search results for each one of you individually. It knows who you are. You log into it.
DANIEL HOOK: Even if you don't log into it, it knows what your IP address is. It will craft its search results to favor the world that you want to see. And this is where we see social bubbles coming out of. And so these are all AI technologies that are unwittingly affecting our everyday lives right now. Imagine a world in which you have a research assessment exercise and your government decides to rank your universities and give them money based on how good they are.
DANIEL HOOK: Then you have a world-- which I think of as a kind of a high evaluation, high-touch world-- which is now using metrics to decide how to rate your institutions. But those metrics are intrinsically based on a feedback loop, which is meaning that you tend to fund popular research rather than good research. Because the argument being, you have so much research now being produced-- I think in Dimensions this year, we'll have something like 5 million publications-- 5 million things with a DOI that pertain to research.
DANIEL HOOK: Those 5 million things, they're far too much to read, so we won't read them. We'll let the computer read them for us and we'll then make an informed decision based on what the computer says. But, of course, we've taught the computer that a good thing is a popular thing, and so you get a feedback mechanism where you only then fund popular things.
DANIEL HOOK: And that's not just in a system, it's a weaker effect in a system where you don't have a national exercise, but still funders like NIH, like NSF, will still be looking at things in metrics-driven ways and they will be pushing their results towards a popular outcome, whether they realize it or not. And this is why I talk about data-supported decisions, rather than data-driven decisions.
DANIEL HOOK: The idea that data should drive the decision, I think should be an abhorrent one to all of us in this room. We want to make our own decisions, which means we need to critically look at the data and we also need to understand what's driving the outcomes of those data. So the types of places that I think we're starting to see AIs come up rather than just in the metric space in research-- I just wanted to touch on a few things that either we're doing at Digital Science or that I've seen happening in the space, which I think are quite interesting.
DANIEL HOOK: So in actually carrying out research, I have seen several technologies in the last 12 months where people in labs have a personal assistant, like a smart speaker, and they can give the smart speaker instructions to start a timer, or to tell them what the next point in the protocol is that they're executing, or to track something which might give us more insight into reproducibility of an experiment.
DANIEL HOOK: So this could be a very positive technology, but it's certainly one where having a blind thing that just completely believes the researcher and what the research is doing could be something where there are ethical things that are coming into play where we don't completely see what's going on. And if we trust it too much, we think because it's on the scene, because it's happening in real time, it must be true.
DANIEL HOOK: And actually, there are a lot of things to worry about with this. There's a company that we made a small investment into a few years ago called Tetrascience. These are some of Digital Science's sexier properties, if you will. Tetrascience is a company that is Internet of Things for the lab. So it is trying to make your lab equipment more internet aware, more aware of their surroundings.
DANIEL HOOK: So with this type of technology, you could imagine coupling that technology that you have with your lab assistant telling you which starter you're typing, starting now, and which protocol you're executing, combined with a data stream from Tetrascience telling you about the things that are actually happening in the lab and what switching on or what's switching off and what's at what temperature, where you actually have a more complete view of what's happening in the universe of that lab at any moment in time.
DANIEL HOOK: Transcriptic is probably our company with the most buzzwords. It is a robotic lab in the cloud using machine learning to optimize experimental outcomes. What Transcriptic is, it's kind of like AWS for a lab, so you can actually take your experiment, you can send all your materials to this place in Menlo Park, they will load it up in what looks like a shipping container with a robotic arm and equipment, and they will actually run your experiment for you inside this confined environment.
DANIEL HOOK: And you can actually automate the experiments and get them to do a complete scan of the parameter space with an AI that then zooms in on the most promising potential features of your experiment. So these are all technologies which move us significantly further forward in terms of getting rid of postdocs. Another postdoc-killing technology is-- going back to HAL again, is 1715labs.
DANIEL HOOK: 1715labs comes out of-- we don't have an investment in this, by the way, just to be clear-- 1715labs is out of University of Oxford and the Zooniverse project, so Galaxy Zoo, and it's extremely good at marking up and using machine learning to enhance the speed and quality with which you can tag data sets and give them different properties and track them for machine learning purposes.
DANIEL HOOK: So all of these you can think of in different parts of the experimental cycle that could really, again, stop us needing postdocs. Writefull is something that we're very proud to be invested in. We invested in this company when it is just two guys who were looking at machine learning approaches to helping us write better papers. And so, they actually have a complete machine learning model based on academic literature so that when you write in the Writefull environment, it will not only tell you when your grammar is wrong and when your spelling is wrong, it will also tell you when you've used the wrong adjective and what a more natural English adjective would be to make your writing more palatable to the reader.
DANIEL HOOK: But not only that, because it's trained on an academic corpus, it understands what should be written academically. So, in fact, it can bring down the quality of your writing to the level of other academics or it can raise your quality, depending on whether you understand the research area in which you're working. There are lots of examples of this, you know, in theoretical physics. There's a thing called measurement theory-- that's a really bad term to have to work with if you work with Grammarly or any kind of engine.
DANIEL HOOK: They'll say, what are you talking about, measurement theory? This makes no sense as words that go together. Again, because Writefull has scanned that literature. It understands that measurements theory is a thing that exists and that you can use to improve your writing. Ripeta is something that's quite interesting right now because we have a lot of discussions about peer review.
DANIEL HOOK: When I was doing my research, I was in a wonderful position where if I wanted to look at the reproducibility of a paper because I was a theoretician, I would simply reproduce the paper. I would sit down and redo the calculation. And that's how we review papers in theoretical physics, if we're being good. We actually sit down and we go through each line of work and say, does this make sense?
DANIEL HOOK: Can we follow the logical argument? Now, obviously, in a lot of disciplines, that's just completely impractical and you can no longer do that. And so, Ripeta is an AI-based technology that is trying to pick up on the hallmarks of reproducibility. If you mentioned a data set, have you made that data set available? If you mentioned an analysis approach, have you been specific about the code that you used?
DANIEL HOOK: Have you made the code openly available? Did you talk about the right version of the code? So, this in broad sense, is a technology which could automate certain parts of the peer review process. And I'm not going to say anything about postdocs at this point. Clearly postdocs aren't the only peer reviewers. But I'll let you leave you to complete that for yourselves.
DANIEL HOOK: One of the things that we've been doing, again, at big scale effort at Digital Science is search and classification. A lot of the AI work we do is in trying to automatically classify papers into specific research areas. And this is a devil of a task. This is really complicated stuff to do well and to do consistently. But it's something where we've had some success, and we think we have fairly good algorithms now.
DANIEL HOOK: But, again, in this world, you're very focused on delivering a product to market, and you're less focused, quite often, on the issues of bias in AI. And so, one of the things that we always need to think about when we're producing systems that classify content or that recommend content is, what's the real bias going on there? Similarly, with Ripeta, when Ripeta's putting its algorithms together, what is it actually biasing because we don't necessarily know what's going on in that black box?
DANIEL HOOK: Is it biased towards papers that are written in a certain style? Is it biased more towards male style of writing to female style of writing? Is it biased towards people who formulate their topic sentences in a particular way? If you put your topic sentences at the end of your paragraph, as some people do, will it give you a lower score or bias the result in some way that suggests your work is less reproducible because it hasn't picked up on how you should have structured your data?
DANIEL HOOK: So these are all the kind of problems that we face in doing what we do. But there are-- you know, things continue to move on, and one of the things that I've been saying for some time is that I believe that we are only so far from books and papers being written themselves by computer. And I think that in future, we may well spend less time writing and editing books and papers, and in fact that there are systems that will write these for us.
DANIEL HOOK: And this is really the first case in point. Beta Writer was a collaboration with the University of Heidelberg and Springer Nature and they've created a book called Lithium-Ion Batteries. It's a fairly dry read, but it is completely computer-generated, and it was sent off to external reviewers and the external reviewers came back and said, it's a little bit of a dry read. But they said it was accurate and it passed peer review independently without them knowing that it was computer generated.
DANIEL HOOK: So this is a [INAUDIBLE] of work or review of work is already a technology that we possess. So that's actually quite scary if you think from an academic perspective. Part of my job, which I spent a lot of time doing, is thinking about how to present a set of results, how to make sure that it's communicated well, how people are going to consume that. And we're starting to see technologies emerge now which actually take that away from me.
DANIEL HOOK: So what is my job as a researcher? Where does the creativity part of my job lie? Is it in thinking up the experiment? Is it in doing the experiment? Is it in the concept behind the experiment and working out what we should test? Or is it, more generally, in bringing the right groups of people together so that we're doing the right experiments overall?
DANIEL HOOK: These are really challenging questions, I think, and certainly ones that go beyond the remit of this audience, overall. But they are touch points in which you will all have insights, and you will all have clients and people you work with who are going to start being worried by these questions over the next few years. Creativity, is for me, the one thing that I think unlikely that will be reproduced or superseded by an AI.
DANIEL HOOK: I think there's something about that fundamental leap of logic that one has to make in order to get to a scientific discovery or research discovery of any nature. Where, actually, the computer can't do that bit. And that was very definitely my thinking. I've spoken to a number of people this year who are looking at trying to do large-scale scans of the research space to try to understand where there are missing pieces, where we could potentially work on particular pieces of the puzzle, where it's kind of, there looks like there's a gap-- shouldn't we do something here?
DANIEL HOOK: So very much going back to the original paper in "Nature" on the strength of weak ties. So there are people doing that kind of work, but I still think that's merely a scanning mechanism that helps us identify things that we should be thinking about. And it still doesn't provide that creative leap to the people like Einstein made when they came up with general relativity. That's not something I believe that could have been thought of by a computer.
DANIEL HOOK: And then I encountered the company Iprova, which was interesting to me because I was thinking, what is the limit of this kind of creative jump. And Iprova's is a great one because they are a company based in Switzerland who help you load in lots of information about a particular space that you're interested in and they add in scientific papers and things that are in the regulatory media.
DANIEL HOOK: And they then try and come up with spaces that you should look at to try and innovate. And they are quite successful. They have these wonderful ideas. Like, for example, they happen to know that your mobile phone has an induction coil in it, and that increasingly people are starting to charge their mobile phones by placing them on an induction coil.
DANIEL HOOK: Now, it turns out that your mobile phone is getting thinner and thinner and consequently harder and harder to pick up. One of the things that this phone also has in it is a motion sensor, so it knows when I'm close to it. So they've invented a technology completely from a set of suggestions that came from a computer and then were taken forward by researchers whereby they can actually reverse the polarity in the induction coil as your hand gets close to the phone and the phone jumps that you can hold it in your hand.
DANIEL HOOK: And I think that's just genius. [LAUGHTER] So, I'm really challenged about actually whether there isn't some level of creativity that a computer can come up with on its own at the moment it's still partnering with humans. But to come up with suggestions like that, actually, that's not an obvious thought, right? And that's actually quite an interesting move forward.
DANIEL HOOK: So, at one level, I could be saying to you we're going into an age where intelligent augmentation is going to help us do significantly more. At another level I could be saying to you no one's ever going to be a postdoc again, consequently we're not going to get them to be research scientists because nobody ever makes postdoc.
DANIEL HOOK: But I think the reality is probably a little bit more nuanced. And so, my parting thought to you is, really that if you think about the different areas in which we all work across this space, and you look at the algorithm, the Wave 1, the augmentation, Wave 2, and automation, Wave 3, there are areas where I can already kind of put in tools that I can see meeting that need or going in that direction.
DANIEL HOOK: There are further areas which I've labeled in yellow, where I can see a tool and I can see that potentially that's going to change the job market because of the nature of the tool. There are areas that I've placed in red where I don't see a tool, but I think actually it would be really dangerous for us to go there. And maybe we collectively, as an industry, make a choice that's not somewhere we're willing to go because we think it's dangerous.
DANIEL HOOK: We think, actually, this takes us to a point where we have ethical issues or where, you know, it's unacceptable that we should be going into that space for a variety of different reasons. And then I've put white spaces in, which are essentially places where I don't think we can play. I think there's a technological limit. So this is a very high level mnemonic for some of the things that I'm kind of musing on recently.
DANIEL HOOK: What I do think is that we will see further effects, though. I think we will see a lot of things where jobs are changing. So, if you talked to people 100 years ago and you talked about a computer programmer, they would not know what that was. It would not make any sense to them. And in 50 years time, given that things are moving faster, there will be job titles which we have no conception of today, where people will have moved into those different spaces.
DANIEL HOOK: Right now, "blogger" is a job title which, you know, probably didn't exist 20 years ago, and I think which probably won't exist in another 20 years. I hope nobody here is a significant blogger. However, I do think that there are fashions in how job titles will move and how our job market will move, and I think everybody in this room is responsible for, has accountability for, and has an ability to change that landscape in a positive way for our space going forwards.
DANIEL HOOK: So, I'd invite you to think what the really game-changing tools are that are coming out right now and that you can imagine coming out in the future, which are intelligent augmentation devices for us, rather than either replacements for us or things that we should be worried about. And so that's where I'll finish. Thank you. [APPLAUSE]
HOST: Thank you very much, Daniel. That was an excellent talk. We have time for one question and-- [LAUGHTER] --sorry. No, that's quite OK. And we do have a break later, and I'm sure we can ask you questions then. But Mr. Harrington, my colleague Sarah has the microphone.
ROBERT HARINGTON: Yes, hi. Is this working? I'm Robert Harington, and I work at a math organization, the American Mathematical Society, and I thought it was fascinating what you were saying about where we're going on the job [INAUDIBLE] management of useful postdocs. But mathematics is very different to, say, synthetic biology.
DANIEL HOOK: Yes.
ROBERT HARINGTON: And so maths, for example, has theoretical physics as--
DANIEL HOOK: We're very similar.
ROBERT HARINGTON: [INAUDIBLE] sport and have some uses products, published, perhaps, in a different way. Do you see-- what's your view, maybe, coming from your point of view as a theoretical physicist to on how different disciplines may culturally evolve-- in terms of how, you know, who is going to go in and do that work and will people become-- will students become more likely to go into the humanities and math and [INAUDIBLE] disciplines?
ROBERT HARINGTON: What's your view on the evolution of disciplines?
DANIEL HOOK: Oh, goodness. This is a tricky one because there are all sorts of, kind of, forces at work here. So, I would say right now, if we look at the economics of the situation, there is less and less funding for arts, humanities, social sciences. And I think that's a big concern for everybody, or at least it should be because I think to have a healthy research environment we need those topics and they should be being significantly funded, not being tapered off.
DANIEL HOOK: So, you know, even as a kind of a hard scientist with a background, perhaps in a slightly more comfortable, safe area I can definitely see significant value in having challenge and having research thinking from the arts, humanities, and social science being brought in closer to my own area. And I think there is a significant value in that. So, what I think is the problem that we are facing at the moment is interdisciplinarity.
DANIEL HOOK: Everybody pays lip service to interdisciplinarity and talks about how we want more interdisciplinary research or multidisciplinary research. But then when you get to the evaluation stage, no one evaluates it properly. Nobody knows where to send the paper. I recently wrote a paper, which appeared in PLOS ONE-- thank you, Anne-- on perception prestige and page rank. So it was looking at ranking algorithms in the context of the genealogy of academic networks and how esteem flows on academic networks.
DANIEL HOOK: And that's quite an interdisciplinary thing there's some quite heavy physics in it. There's some reasonable amount of maths. There's also-- I was collaborating with a colleague who's a social anthropologist at Oxford. And so, getting that paper reviewed took a year because actually finding the right people to review it was a complete nightmare. And I think we increasingly have the tools to go through these different areas and actually connect things which are only peripherally connected.
DANIEL HOOK: But then we lack the understanding how to use those tools or how to take on those areas as human beings. So, I think there are really big challenges in that space. So I guess to answer your question more directly, if the funding holds up, I can see a massive diversification of what people specialize in and the types of approach that we take. In some sense, I think that every researcher, regardless of their discipline, if they go to do a PhD, they should have a data science component.
DANIEL HOOK: Arts, humanities, social sciences, everyone should do a basic data science component. And if you take that to it's, kind of, infinite limit, then you're thinking more about-- some of you may know of the Melbourne Degree-- it's quite famous in Australia-- where they take you in to do a degree and you don't specialize until much later in your curriculum. You have a general understanding as an undergraduate and then you specialize in graduate studies.
DANIEL HOOK: And I wonder, actually, if we aren't moving to a world in which graduate studies start to cover the same model where, in fact, you do a PhD, but you do quite a general PhD in order to equip you to understand how to do research. And then you specialize beyond the PhD into a more specific area. So I can see some kind of stratification like that coming up where, in fact, maybe you need to do PhDs or there's a thing above a PhD which takes you more directly into a field.
DANIEL HOOK: So I think that may be-- just thinking generally about the area-- that may be an approach which we see emerge over the next few years. It will replace all those postdocs. [LAUGHTER] Well, thank you very much. [APPLAUSE]