top of page

On-Demand Webinars

DCL Learning Series

Publishing at the Speed of Science: A Lunchtime Conversation With Bill Kasdorf

Upcoming Webinars / On-Demand Webinars

Webinar Transcript

 

Marianne Calilhanna (00:05):

Hello, and welcome to the Data Conversion Laboratory learning series. Today, we are thrilled to host a lunchtime conversation with publishing luminaries, Mark Gross, and Bill Kasdorf. My name is Marianne Calilhanna. I'm the vice president of marketing at Data Conversion Laboratory. And I will be your moderator today. Before I turn it over to Mark and Bill, I do want to encourage everyone to submit any questions or comments via the Q and a box. We are recording this webinar and it will be available in the on demand webinar section of the DCL website. Before we begin begin, I would like to provide a short introduction on Data Conversion Laboratory or DCL is we are also known. Our mission is to structure the world's content content can unlock new opportunities for innovation and monetization. When it has the foundation of rich structure and metadata for content to be easily discovered and used across various platforms, it should be converted into an XML format from which machines can extract pertinent information, DCL services and solutions are all about converting structuring and enriching content and data. We are one of the leading providers of XML conversion services did a conversion structured product, labeling conversion and S 1000 deconversion. Many people are well aware of our excellent content conversion and transformation services, but we also do a lot of work in the other areas listed on this slide, entity extraction, third party validation of previously converted content semantic and metadata enrichment data harvesting, or web scraping content, reuse analysis, and structured content delivery to industry platforms. If you have complex content and data challenges, we can help.

Marianne Calilhanna (02:15):

Now, before we begin, I would like to launch a quick poll just so we understand what your role is in the industry. So I'm going to invite you to simply answer the best you can from the, from the options listed below. What is your role in the industry?

Marianne Calilhanna (02:42):

And I'll give just a minute for that. And while you're submitting your answer, I do want to let everyone know that Bill and Mark have not scripted their chat, because we really wanted to have a natural conversation about publishing about publishing not only during a pandemic, but also during a time of great change for our industry. All right, I'm going to close the poll and just share the results. So everyone knows who we have with us today. So almost half of you are from the scholarly publishing industry. Looks like we have a, some educational publishers, legal trade publishers and some service providers. So welcome everyone. Right. It really feels funny for me to introduce two gentlemen that truly need no introduction. Mark Gross is president and founder of DCL. He established Data Conversion Laboratory 39 years ago. And as an industry leader in structuring content and digitization practices, Bill [inaudible] is principal of store and associates and a founding partner with publishing technology partners. He's an expert on accessibility, XML, HTML, [inaudible] modeling and specification standards, alignment and workflow. Thank you both for joining me for lunch, Mark. I'm going to turn it over to you.

Mark Gross (04:29):

Okay. Thank you, Marianne. I'm really excited about this and I'm really it's really wonderful to, welcome Bill to our, to our webinar series. I, I and it sounds scripted all the ins and outs of course of that. I'll I want to start by Bill. We actually haven't seen each other in a long time. It's the last few days. So each other on video, which is interesting because you know, the big able to be on these video conference means you can see people you don't see otherwise for, except for conferences every once in a while. But tell me, Bill tell 12 of what you're up to these days and you know, what's, what's, what's going on in your life. Sure. Thanks. Nice to be here with you, Mark. We've known each other for a long time. I will I can't resist mentioning that I did a book many, many years ago called the Columbia guide to digital publishing, guess who I got to write the chapter on data conversion markers. So we go way back. So in terms of what I'm doing recently as Marianne mentioned you know, most of my consulting work centers on things like editorial and production workflows.

Bill Kasdorf  (05:58):

And content modeling, metadata all those kinds of geeky things. I was talking to my PTP colleagues the other day. And as we got an inquiry on a publisher that wants our help, and one of my colleagues mentioned, boy, it maybe sounds like they need, maybe want to get into the weeds. And I said, well, that's good. That's what I do. I do weeds. Right. But the other thing I always say I do is I always used to say and accessibility, and I've tried to stop saying that because accessibility is really part of all of that. So any of those engagements, I always factor accessibility. And then one other comment I might make Mark that I think might be relevant is that, you know, why do a client, why have clients come to me? What are they looking for? Fundamentally? I think the main driver is that they realized that they're kind of stuck in an old print centric workflow, a book centric, workflow, and journal article centric workflow. And they realized that they really need to be thinking digital first. And that may sound obvious now, and we've been talking about this for years, but frankly it isn't obvious or because, you know, editors, production people, et cetera, is really hard for them to envision their content in a different way than this kind of chapter based or article based format. So, so anyway, so that's what I do. It's very fun. I'm a busy guy,

Mark Gross (07:36):

I guess one, I, you know, we sort of said when you said you know, the 700 pound gorilla today is covert and that's what everybody's talking about. And we agreed that, that wasn't going to be the centerpiece of our conversation, but it's probably the backdrop to a bunch of things that are going out of here. So I think just in terms of publishing and, and the, the title of this webinar publishing at a speed of science, one of the things that's going on right now, and COVID is excessive baited. It is this whole idea of getting science out as soon as, you know, anything preprints even earlier than preprints first concept thoughts. And it's become, you know, in the last few months it's been like, how can we don't know anymore? How come well, I just looked at it yesterday. So how is that affecting? How is that affecting what's going on? What are you saying?

Bill Kasdorf  (08:32):

Well, yeah, that's, you know, you're absolutely right. And basically, I think one thing that's a notable about that is that, you know, the, particularly the journal publishing process hasn't suddenly changed. So, you know, preprints have been around since archive, you know, the Los Alamos was originally at Los Alamos for particle science, right. Particle scientists. So they've been depending on preprints for decades, really, but, you know, particle science to most people, that's not the real world, right? That's not, that's not where I live, but now it's all the science is doing this and has been for awhile, but suddenly the public is paying attention to the damn preprints. And, you know, I read a comment recently, sorry about my language there, but you know, preprints are for professionals they're designed for your professional colleagues, so they know what you're working on and they can comment on it, et cetera.

Bill Kasdorf  (09:37):

And this is a very good development, but the big problem is the public doesn't understand how to interpret those, those preprints. And doesn't realize that that the results have not been vetted and verified like peer reviewed published article. So that's a, that's the, he's making the point that, you know, the, the, the current journal publishing workflow hasn't actually changed. Although it's accelerating, I've got, what am I? Well, actually a former client that I've done some work for Mac that currently is so new England journal of medicine. And I, current client is annals of internal medicine. You can just imagine what those people are going through right now. They're just unbelievably, simultaneously. They've got a fire hose of content coming at them, and they've got incredible pressure to get stuff out right now. Annals, you know, I was just blown away. When I first started working with Annals, they have a database of 25,000 peer reviewers.

 

Bill Kasdorf  (10:39):

And the process they use to vet an article is incredible how many layers of analysis and how many steps they go through. And at the end of it, it just is a kind of provisionally accepted manuscript. And they also have a staff of statisticians actual in house statisticians. I think they've got seven people that validate the statistics and every article. So, I mean, this is really, really thoroughly vetted content. How do you do that in two days when you've got a pressure to get something up, right. It's crazy. And they've got hundreds and hundreds of things coming in every day

 

Mark Gross (11:17):

And yeah, and also, I mean, a few of our clients we've, we've helped them identify covered related articles that they had in their bags. I mean, this is this coronavirus is not a new concept. It's been around for four years. It's just this level of virus. So so we help them identify it and we set so they can go right online with them. This is, could never have done it 10, 15 years ago because you didn't have the right metadata on you. It all depends on these new publishing techniques and being able to you're ready for that. So I, you know how soon can an article really get out there? It used to be years now it's been condensed, but there's still things that need to happen. I mean, can you really speed up publishing that much?

Bill Kasdorf  (12:08):

Well, I think it actually is getting sped up, you know, this, this coronavirus Covance situation has just made something that was kind of about to happen, have to happen. Right. And so you know, I read recently that I can't quote the name of the of the initiative, but it's a group of basically scientific experts in a variety of fields that are being assembled to be able to provide a kind of, I don't want to say instant or immediate, but incredibly rapid vetting of papers for the news industry so that the news industry can, can say, well, here's this pre-print that came up instead of just jumping to some conclusion about the pre-print run it by these folks and, you know, in a day or two, they're going to get back to you about, you know, what can you say about that?

 

Bill Kasdorf  (13:09):

Or what can't you have to, it's a pretty cool thing. Isn't, that's not, that's, that's still the full traditional peer review then presumably that article will still go through. It's not a substitute for it, but I think, I think that's really interesting w one of the other, you know, you may know that I'm involved in lots of different standards organizations and the one that most people probably on this call haven't heard of is IPTC, which is the international press telecommunications council. It's all the big news agencies and news organizations around the world. And, you know, they're dealing with this problem from the, from the other side of that, the screen is, as you, as you might say that you know, they're getting diluted with this stuff and they've got to sort out, you know, what can they say, that's responsible and they still need to get the news out. Right.

 

Mark Gross (13:56):

Right. Right. And, and, you know, it's, it's a, it almost sounds like what you're describing here is like taking crowdsourcing to a much more serious level. I mean, it's,

Bill Kasdorf  (14:08):

It's a validated crowd, right.

Mark Gross (14:11):

It's a validated crowd, but, you know, we've seen that even unvalidated crowds can get very accurate. But I think it's gotten gamed a lot, but I remember when they did some analysis of what was going on Wikipedia, they found a lot of accuracy based on just editors who were almost self-selecting. But but but now it's gotten a little game, there are organizations like going out there and doing all kinds of stuff, but, but you know, the, the the communications capability of God here, and in fact, you've got everybody on computers, everybody on broadband, you've got, you know, Billions people out there. I means that even if you select carefully, you still have millions of people out there. So that'd be really could change things a lot, I would guess.

Bill Kasdorf  (15:00):

Yeah. You bet.

Mark Gross (15:02):

So

Bill Kasdorf  (15:04):

Not one thing it's services, think of all the millions of people who don't have access, you know, all the people in the digital deserts and the, you know, the global South that, you know, it's, it's not as easy as it is for you and me. Right.

 

Mark Gross (15:19):

So how big is that global desert? I mean, we're where we've we hear about the, you know, the, the people I follow, it's like a huge percentage of the world population. What, how, how big is that? Those are, do you have any sense of that?

 

Bill Kasdorf  (15:36):

I think the number is in the Billions. I can't quote the number I've mentioned. Some of our attendees could probably write chapter and verse on that, but it's having the device, it's having the broadband access that enables you to get the content. And frankly, and other thing that both you and I are passionately committed to is standards. And underneath all of this is standards that actually makes it work. Right. So that's exactly what I've been giving people. Also a COVID related thing is, you know, everybody's on video conferences. This one happens to be go to meeting, but everybody's on zoom or they're on teams or they're on blah, blah, blah. And I was at a teams meeting the other day with a client. And for some reason teams, I, you know, I have teams installed in my computer, but it didn't come up and I can just click the button said, we'll use your browser and we're fine.

Bill Kasdorf  (16:31):

You know, why did that work fine? It's because there's a standard called a web RTC, which is a web realtime communication. And it's it's what enables that to just work without you even being aware of it, you don't have to know which browser you're using. And, you know, people actually grade these standards. What thousands of people are working, girls are working on these standards, but you know, who really drove the web RTC working group and is driving that standard, Microsoft, Apple, Google Mozilla, and the browsers fierce competitors. Right. But they recognize that it's in their interest. If this doesn't work on all of them, it's not going to work. So they kind of have to put their competitive impulses aside and sit down at the table and work together to get this to work. And that's why it works for us. And we don't even have to know about it anymore. Right. It just works.

Mark Gross (17:34):

And I'm glad it also, there are sort of Stan was in the human engineering part of it because like the other day I was on five different technologies and five different meetings in the course of six hours and just keeping track of, and I seem, be able to figure out how to turn on my camera and each one of those, which, because they're doing things in similar ways, that's a good thing. Cause nobody has manuals. I mean, was this nothing to read? So if it doesn't work the way you think it would, you'd never get anything working. You're not talking about accessible accessibility. And you know, a large part of the world does have access, but just in terms of accessibility, we usually think of as people who have disabilities or people who are, and I know you've been actively working on that, it's been in the news lately because for people who are hard of hearing people with masks, people wearing masks has been difficult. So it just came off within those reasonably. But certainly it's been a bigger issue for, for many years all than that on what's going on.

Bill Kasdorf  (18:42):

Yep. And of course, what I think is the best news on the accessibility front is that, you know, making, making content accessible to people with particular disabilities is still a challenge. And you do sometimes need to do a specific kind of remediation of a, of an asset for a particular user. But having said that it used to involve special technologies and special XML formats and special products basically. And accessibility standards have really all converged around web accessibility. Wikag is the foundation for web accessibility. It's the foundation for accessibility specifications around the world. So, you know, us as a section five Oh eight refreshed a couple of years ago to align it with what KC and the EU requirements align with wikag et cetera. And so, you know, now to get a website accessible or an ebook accessible, it's just getting your inch to Mel. Right. And you know, it's, yes. You maybe have to add a few things like some Aria markup, et cetera, but I shouldn't be getting into the weeds. I told you I was a weeds guy. Right. But well the weeds are oftentimes the only missing pieces image descriptions. That's, that's still a challenge for people.

 

Mark Gross (20:12):

Right. Right. I mean, there are things that you need to do

 

Bill Kasdorf  (20:15):

Tools you're using and the technologies you're using in the market you're using you know, not only do you probably already know that, but the vendors that are creating your products know how to do those things. And that's just a huge benefit.

 

Mark Gross (20:29):

Right. But it's also on the thing of best practices. We said image descriptions need to be the root cause of, of these nones not cited. They won't be able to see the image. They need to some description of it. But also things that people forget about is a colorblindness is much bigger problem than the non sightedness. And people are careless wood, wood, wood, wood, colors of maps and lines and all kinds of things like that. And so with the web, you've got imagery, images, and moving images that may disturb people, certain kinds of disability. So this there's a whole, as you know, this, there's a whole bunch of things going on over there. But

 

Bill Kasdorf  (21:11):

In addition, I shouldn't just want to interject, in addition to colorblind people, low vision people, you know, there's 10 times as many low vision people as blind people. And, you know, they need to enlarge the fonts, which you can do with an IPA, but it, you know, try doing that on a phone with a PDF it's, you know, it doesn't work and dyslexic, you know, you need to substitute the fonts to put a dyslexic font. And, and, you know, I know I'm sounding like an evangelist for HIPA, but that's because I am. But that's why you brought me so much more accessible.

 

Mark Gross (21:44):

You just mentioned something fun, fun

 

Bill Kasdorf  (21:53):

Dyslexic people, for example, have a problem with letters that are actually the same thing, but rotated, right? So a P and a Q a or a, and a D a lowercase D lowercase P a lowercase Q you know, it's really pretty much the same shape in most funds. Well, a dyslexic font makes those three things distinctly different, and it makes a huge difference to a dyslexic person to be able to, to read that they need spacing and adjustments and stuff like that, too. But just being able to, I got a demo of a, of a, of a new technology that's just coming out yesterday and they've got a dyslexic fund built right into their platform. So that's pretty cool.

 

Mark Gross (22:42):

That's, that's amazing. I didn't know that, but it's amazing how something like that. I know people have been designing fonts for generations and fonts have always done. I mean, the museum of modern art has a collection of thousands of fonts and people don't think there's an art form, but you also, it's also some practical implications to something as simple as a font. I mean, we'll look at the ones that we have available. We would load up Microsoft word, but there are many funds that are very specialized. Very, so that, that's, that's a very interesting point. So so yeah, I think the point is that once things go into, into into an XML or into something like that, that, that that separates out the content from how it appears, it makes, it makes it very feasible to make things accessible we're in before.

 

Mark Gross (23:32):

And then it becomes the instruments that are out there and the computers are out there. And the ability, it's just a matter of enlarging and doing things like that, which which, which, you know, will make things lot better over time, as long as people apply those kind of standards. And as again, that's talking a little about you know, you're, you're doing a lot of work in education and K through eight. I mean, so, you know, what, what and certainly at the 700 pound gorilla, again COVID as a, as, as, as board things up, people have to learn at home, all kinds of things like that are going on. And I think what we've discussed, I mean, it's been said the code isn't really changing anything that wouldn't have changed otherwise, but it's, it's it's it's expediting, it's accelerating change, which might have taken 10 or 20 years is now might be happening in a space of six months or a year. I mean, we we've moved our office offsite in the space of two days, had everybody working off site I wouldn't have thought you could do that in, in six months or a year. So we've been doing amazing things just because we have to. But what about education? How's that? What are the changes that are going to be long lasting?

 

Bill Kasdorf  (24:50):

Well, it's still hardest in elementary school. Most of my work in education is in higher ed. And there you know, certainly is a big publishers and lots of medium size and even smaller publishers for years have been trying to move again. I mentioned getting away from their Prince paradigm and moving to a digital paradigm. Well, you know, you can do a lot more teaching in a platform that can be responsive to houses, students progress, and can be interactive et cetera than a paper book, but, you know, people like their paper books. And so it's been hard for publishers to get folks to get, you know, to get off the paper book that they don't want to keep having to make and move on to the final. And there's two sides to the platform. There's a publisher's platform, but there's also providing the content to the learning management system that the school uses.

 

Bill Kasdorf  (25:50):

Right? So again, it's standards that make these things work, you know, because there's a bunch of different LMSs out there. So that's your facilitating that, because now it's like, you know, you need to, you need to do it online, but I've got a client right now that, you know, the reason that they're doing this is that they've been publishing their credentialing organization, but it's, you know, they have a curriculum it's educational and it's in fact very complex. And they publish a giant amount of content every year in these giant print books that, you know, these people have to study in order to pass the exams to get their credentials. And what they're finding is that, because now they've also got that content online is that people actually prefer to use the online cause it's, it's, it, it actually teaches better than the but then the print. And so

 

Mark Gross (26:46):

Right now that's been the movement and other areas have been going on for years and something has happened very quickly. I think you know, in the history of where we've both been working, I mean, I guess it first happened with legal books and legal publishing were in the nineties, started moving off where and part was because, Oh, exactly. That, because you had all, you have all this material that needs to be updated all the time and, and the particular big law firms, they don't want to spend money on all that real estate because

 

Bill Kasdorf  (27:18):

Also a giant amount of content, right. To be able to access all of it, kind of all at once.

 

Mark Gross (27:25):

So right. And now it's moved down and moved to a scholarly publishing. And the medical publishing is also because there's just huge amounts of that. And I, it sounds like education is, is the next frontier over here. I'm still wondering why it hasn't happened faster. And is that and, and, you know, you, you made a point that publishers want, want to do that, but it is part of it, their mix of it, is it hard to figure out how to monetize that? I mean, that seems to be an issue and people go online. So many people think everything should be free. Is that something that you've been seeing

 

Bill Kasdorf  (28:05):

An issue? That's actually a, quite a challenge for commercial educational publishers, because there's, what's called OER, open educational resources out there. So, you know, I was talking to one of my clients just a couple of days ago and he's at a major university and they basically ask all faculty when they get their curriculum together in their, you know, their, the reading list for a course, they have to justify now, if, if if the student has to spend more than $25 to get what they're designing, they have to justify that like what the university is mandating that on all it, and you would know, you would know the university. It's a really big one. And they're not alone in doing that. So, wow.

 

Mark Gross (28:57):

So what does it do to the economics of the industry and what, what does the publisher, I guess, why not, why not that $25 book or why not those free resources?

 

Bill Kasdorf  (29:09):

Well, yeah, it's all free resources. You know, you mentioned that, you know, these things come in waves and it hits scholarly publishing first. So, you know, you'll remember when open-access first came along and people just freaked out, you know, about open access and justifiably. So, you know, commercial publishers, society publishers, particularly because they were dependent on the revenue for their, from their journals to help keep the society alive. But you know, now it's mainstream and it's taken for granted. And, you know, I have to say, I, I tend to be a, you know, a positive oriented person. Let me just say, but I'm not, I'm not cynical, but I'm about to say something cynical because frankly it was when the giant publishers discovered that they could just get paid upfront for doing the work. And then they made it open and free. They thought, well, okay, that works for us.

 

Bill Kasdorf  (30:03):

Now I'm oversimplifying dramatically here. And we've still got a huge uproar with Plan S and, you know, different kinds of commercial deals that are a lot of experimentation and, and turmoil is going on in that area. But at least, you know, they're kind of expectation that, you know, what this content needs to be open and free. And then the funders are kind of pushing to make that happen is completely restructuring the whole commercial realm in, in, in STM and journal publishing, certainly it's problem for humanities and social sciences, but education hasn't figured how I figured out how to deal with that yet. So it's, it's a huge problem for education,

 

Mark Gross (30:48):

Right? I mean, I think you're, you've got a point, the very important point you're making over here, that they figured out a whole new, a whole new sales model. And it's actually might be better than the other one. Getting paid up front is not a bad thing. It's an, and, but even though you're not getting that long-term income on an ongoing basis, but those materials are not the kind of materials that would have long term necessarily, I mean, there's a long tail on that.

 

Bill Kasdorf  (31:20):

And that mostly works in the sciences because, you know, they have the funding to pay those article processing charges upfront, but it's a huge problem for humanities scholars, for example, that, you know, they spend, they spend 10 years working on a book and they're not gonna make any money on the book and they're supposed to pay thousands of dollars to get that book published, just so that it can be Open Access? So how that gets funded is still being worked out, right. That's not going to happen, it'll happen, but that problem hasn't been solved yet.

 

Mark Gross (31:53):

Right? I mean, well, I mean the same issue came up with, with libraries and and eBooks at libraries of how to, how to public hers monetize that. And so the, the model now sounds a little kludgy to me, but apparently they they get a certain publishers license, an ebook for a certain number of of, of uses, which is modeled and how many times they think a print book would be used. So I think work is good for 20 readings that the ebook is sold, but 2020 loans, which is, sounds a little kludgy, but it's really, I think that's gotta be a temporary kind of thing. Somebody's, that's a case where

 

Bill Kasdorf  (32:40):

The print model is still driving that because, you know, in print, the model is for bestseller a given library, depending on how big the library is, might buy 10 copies of that book. Are there mind if I shift you copies of that book? Right. Because they can only give it to one person at a time. But you know, you, you brought up earlier, you know, like get ahead of impact this whole COVID year is having on everything. Here we go again, right. Is that, you know, there were, there were major trade publishers that had policies that the librarians were just flipping out about because they were so restrictive. And suddenly, you know, kind of the, the publishers were looking really bad by saying, you know, no, you can't have access to this content. So a lot of publishers kind of eased up on those kind of draconian requirements.

 

Bill Kasdorf  (33:36):

So the library's delight, but, but on the other hand, there's a financial impact on the publisher, right? I mean, books were being loaned out. The only books that were being leveled out of libraries were eBooks for the last four months. Right. So the same kind of thing is going on with movies. No, no movies at movie theaters, a suddenly everything is being downloaded to the streaming services where before you wouldn't do that for two months or whatever, till to the things that came available. Yup. I see Marianne. I wonder if she's got a question for us.

 

Marianne Calilhanna (34:11):

I have a number of questions for both of you. Yeah. Now, I'm just going to interject for a moment here. But with the impact of the pandemic on events, many publishers, including those outside of scholarly publishing are turning back to their publications to try to find revenue. So how could using XML help publishers to create new products and try to find new revenue streams? Wondering if you could speak to that for a moment,

 

Bill Kasdorf  (34:46):

You want to take that Mark, or you want me to go ahead first? You start. Yeah. Well, you know, another major trend, I mentioned thinking digital first, instead of thinking print first in terms of how you, how you create your content, how you manage your content, how you deliver your content. But another aspect of that is people are consuming content in more granular bits, right? They aren't necessarily getting the large publication chunks that they used to. So, you know, if you've got XML and particularly if you've got some semantics associated with that XML so that, you know, what, in all this XML that I've got here is about X. You know, you can go into that repository of content, me to just characterize it that way and say, you know what? I've got all these, let me just give you a mundane example is I published a whole bunch of cookbooks, right? So I've got Italian cookbooks. I got French cookbooks and I've got Mexican cookbooks, et cetera. And it's like, well, you know what? I can put together a bread cookbook because everybody's baking bread all of a sudden, right. So I could look at all that XML I got and say, give me all the recipes that are about bread. And now I got a new book to sell, right. So that's just one example.

 

Mark Gross (36:06):

Right? And, and, and I think well, first of all, the recipe example reminds me. You could do more than that. I have a, I had a client who took all these that they pick all their, their cookbooks, turned them into XML and were able to sell a version of, with, it would go to a Nestle chocolate cookbook. They would go to Nestle and have all Nestle products in there. Same thing can go to a competitor that had their products and they could do all those kinds of things with them. But also, I mean, I think you make a very good point. I mean, as people are, as the market widens there's a much less long tail. There are people who would never have looked at certain books because it wasn't available. They didn't know where to find them. And w and now they could find more, the, a very esoteric book might not have a much larger audience than they've ever had before, but it has to be delivered in different ways.

 

Mark Gross (37:00):

It has to be delivered with. So somebody could instantly get an under phone. And the only way you can do that is if it's an XML. So it can be we form. And we organize and taking them little bits as you go along. And, you know, going back to, what's been going on the last four, four a month, four months. I mean, those publishers who have been very digital, I think you've been able to take advantage of that even now, and be able to that if there weren't ready for that, we've never been able to do. It's just that it wasn't time to set it up. So you know, those are thought ahead, I think are doing well with that. I don't know if well is the right word. It's still difficult times for money, but certainly

 

Bill Kasdorf  (37:44):

Yeah, they, they they're, they're in a transition. Transitions are always hard, but, you know, you can come out better on the other end. You know, one of the ironic things is that books don't knit don't need to go out of print because they don't use print.

 

Mark Gross (38:00):

Right. Ever they're never out of print anymore. Right. We've got an inventory of books. Go ahead. Sorry.

 

Marianne Calilhanna (38:11):

Yeah, Mark. I have a couple of questions for you. So was asked to has DCL been busier since the pandemic and tacked on to that, what services are DCL delivering due to COVID-19 that they did not see before and were not needed before? And of course, I think that to the second, third week of March that

 

Mark Gross (38:36):

Weekend, and maybe you could share some of that story. Right. So I, you know, I think we're very lucky to be in an industry that's all digital. And so in many have not been that lucky we've been able to pretty much keep our regular business going. And many of our customers are all digital. I mean, they're getting content and data from all kinds of places and, and, and that's continued. And, and even, you know, I was worried about like the conferences business, and I actually were going to talk about, I was going to ask you about that, where that's going, but but conference proceedings seem to have still been going on. People have made material available. So even though the conferences having physically taken place they the still the, the, the proceedings have really been done and, and distributed. And so those are things have been going on in general.

 

Mark Gross (39:35):

I mean, specifically what covert, I mean, I mentioned two of our clients have asked us to go and take all their COVID related papers so they can issue them right away. And some of those were done over a weekend. I made with became like, okay, we gotta get roll out by Monday and it'd be, cause they were digital. And they were able to gather, they were able to find them and we're able to publish things very, very, very quickly. So I, you know, it's, it's this didn't happen by accident. I mean, for women terms of DCLS for, I mean, I think we've been, I didn't know we were preparing for this, but for the last two, three years, we've been really upgrading, been upgrading phone lines and upgrading equipment so that you can actually start working off of it. We never knew what that would be for never in my wildest dreams that anybody think of this I think this is truly a black Swan, right.

 

Bill Kasdorf  (40:41):

You know, you were mentioning the conference business and there's a, an interesting parallel to what we were talking about with education. And this is actually a point you made to me in our conversation. So I shouldn't be taking words out of your mouth, but, you know, it's, it's, that is being physically present the best way to consume a presentation or not. And it's like, you know, I think I'm a firm believer that like conference business will come back, but in a different way, because I think that, you know, the, the, the network dating the hallway conversations, you know, the ability to meet new people. It's not just seeing old friends, it's meeting new people. It's, you know, being, you know, I, I, people, people often say that I, I seem to know a lot about stuff, right? How do I, how did I learn all that stuff? It was going to conferences all these years, right? I just paid attention to what people were doing, but you don't actually have to be physically present to consume the presentation. You have to be physically present to interact with that person. Right. And the same thing's happening in education. I'll let you take that because that's actually a point you made to meet the other.

 

Mark Gross (41:55):

Yeah. The point I was making this that maybe education is being done all backwards. And and and actually some elementary schools have gone to a model where the class presentation by the teacher is done over video and done in the evening while the homework is being done in the classroom, which is totally back with water. We've been thinking about, but this way the classroom is done under supervision so that a child can get instruction where he's having problems. And he can be listening, listening to a presentation can be done on video. So that really turns things on its head, but it's, it's, it's, it sounds like a great model. It seems to be working well. And the same thing might be happening in conferences. I think Bill, you made the point that there are conferences where the materials are sent in advance and the conferences, the intention is to be networking and talking to those people.

 

Bill Kasdorf  (42:48):

Yeah, that's right. That's what they have. They have a semiannual confab that gets lots of people together. And they basically just recorded all the presentations that would have been made at that conference and said, here, watch these in advance. And then we're going to have two, two hour conversations as Webb Collins and no agenda, all Q and a. And it was fantastic. I think, you know, certainly the largest zoom call I've been on, I think they had 400 and some people on a zoom call. It was, it was amazing, but it really worked.

 

Mark Gross (43:27):

How do you coordinate with 400 people on asking questions? It's like, I don't know how you do that.

 

Bill Kasdorf  (43:34):

You know, this, this is the W3C. And that group also uses what's called IRC, which is internet chat for almost all of its meetings. So basically you just basically in effect, put your hand up in the internet chat and you get in a queue and then they are watching that may answer the questions or with zoom, you can, you know, you can raise your hand. Okay.

 

Mark Gross (43:58):

Right. So

 

Marianne Calilhanna (44:00):

You had someone who folks in with a question like I'm going. So just to shift a little bit could you speak a little bit about multichannel dissemination in particular workflows that are digital first and print second share, share some knowledge on that.

 

Bill Kasdorf  (44:24):

Yeah, yeah, exactly. And then frankly, that's probably why people are feeling the pressure to make kind of paradigm change in their workflows to be digital centric because they realize, well, you know what, wasn't long ago that the EPUBs came out weeks and weeks after the book was after books book was published. And now the expectation is, well, you should have it at the same time. And by the way that he pub ought to be accessible, there shouldn't be a an accessible version that has to be made later. It's like, no, it should be accessible in the first place. But in terms of workflows the tools are basically there already, right? I mean, we have the, the models, we have the standards they're always in, in evolution. So I wouldn't want to give anybody the impression that the job is done. Right. But it's become mainstream to, to work in a digital way as Mark was saying.

 

Bill Kasdorf  (45:25):

And because of that, if you get the, if you get the right kind of modeling in your content, so you're not thinking of chapters, you're thinking of topics. And you can, I think Maryanne, you were asking when you are talking about multichannel, you're asking for a given publication that creating various publications, but at the same publication being published in a variety of ways. The fact that so many of the standards are really XML and HTML based really helps or yeah, scholarly publishing it's Jansen bits, XML completely it's. The link goes to the lingua franca of scholarly publishing, but I was just working with a client, a current client that I have where, you know, I basically pointed out to them that everything in there, jazz XML, this is a major medical journal, everything, everything in the Jeff's XML model, the tag library provides the ability to tag everything you need for accessibility. And then they need a specification. A big believer in a model is not a spec, right? In other words, they're still need a specifications for how are you going to use Jan's XML? But if you do that right, then you've got everything you need in the jazz XML to just transform it to the HTML that you need for your website or that you need for your eBooks. You're in pubs,

 

Bill Kasdorf  (47:03):

Well, someone available in jazz, but you have to have the policies and procedures and whatever in place that you're putting all the things in and not taking shortcuts around that. Yeah, exactly. I get so many, you know, so many people still say, well, wait a minute, I'm using jazz. Why is my jets XML for add upon math? The same as my jazz XML for silver chair, it's like, well, those are specifications, right? They don't, they don't add, ask for it to be done in exactly the same way. So you still have to work with who's the recipient of this XML and what do they need you to do so that their systems work properly with it.

 

Mark Gross (47:43):

Right. Right. And if I get, I mean, we're talking about jazz for scholarly publishing, but there is other forms of XML that I used in other specific purposes for various reasons. So like a Dita is, is used not for scholarly publishing, what's used in technical publications, but you want to be able to modularize things. So the, the, the, the, the, what, you know, I, I, you know, when I first saw XML washer, when I first saw SGML, I said, wow, this is a language for really describing what texts looks like. And that's what XML it, what XML is, what you can define the purposes for it, depending on what you needed for. So there are many forms of being used out there. Nice thing about standards is there are so many of them, right. But but you know, in any event it allows you to be able to communicate between the various people that are out there to make sure you're doing things in the same way, and that it works the same way.

 

Bill Kasdorf  (48:44):

Yeah. And that it's optimized for a particular purpose. So, you know, jackets and it's counterpart BIS and have for years and years been optimized to represent scholarly content. And it's, it's, there is no better way to represent scholarly content than Jackson bits. But guess what, when you're looking at a website, you're not looking at Jansen BIS, you're either looking at PDF or you're looking at [inaudible], but there was always Jetson BIS, or I'll give you an accessibility anecdote. I was in a working group. This is, goes back a few years with so many of these working groups are just really, really technical people. And this guy was just a total expert tech geek. Right. And he said, he just kind of posted it to the list. Why do we need to be paying attention to math ML? Nobody uses mathematical. And I went, wait, no, he's thinking of he pumps in trade publishing and they never seem FML. And even when there are equations, the math, the mill isn't in the IBA, because it doesn't work. Right. So the image of the equation is in there. And he's saying, why do we even need to pay attention to this? I said, you have no idea that millions of equations are being created in mass and mill all the time. And any method, any math equation that you see in a scholarly journal or a scholarly book, I can almost guarantee you was that was math ML at some point in the workflow. Right.

 

Mark Gross (50:21):

Right. Or, and, and I think that goes through for any specialized kind of area. I mean, I, you know, I don't know what happens to like music publishing, but it's probably some version of that that you'd use in music publishing and invoiced music languages. That's true. So, you know, so if you've lived with, look at music, you'd say, why would anybody ever need that? But I think it goes through an any specialized field. And I think the beauty of XML has been is it allows that ability to be able to apply the specialized, the additions that you need in order to be able to work in many different areas.

 

Bill Kasdorf  (50:58):

Yeah. And it's really, it's really in layers. Right. So you've got that underlying layer of XML ma you know, a lot of people are shocked when I remind them that we are still using XML 1.0, that is the governing XML spec right now is XML 1.0, right. So it's just radically useful. Right. But then it gets specified for data, for topic based offering and chats for scholarly journals, et cetera, et cetera. But then there's a layer on top of that that says, well, this is Jasper that is going to wire jets. That's going to Silverchair with Jess is going to add upon, but then you've also got the publisher is saying, but here's what I need to do with this content. Right. So they've got their own specification on top of that. But so it's, you know, it's, it's like layers, but it's not, you throw this one out and do this one. Instead they build on each other.

 

Mark Gross (51:58):

That's right. And it's moved, moved up or over the years has really been pretty interesting. So, so Marianne, any other questions you've got over there? It's

 

Bill Kasdorf  (52:14):

Because we only got five minutes left before we have to wrap up.

 

Marianne Calilhanna (52:17):

Hmm. Well I will share one thing. You, you do have someone who disagrees with you about the use of math ML in that IEEE processes and displays millions of equations without any math. And they'll change that for you if you want. IEEE, but

 

Bill Kasdorf  (52:40):

I have several very good friends that IEEE. So I'm going to dig into that because I would be surprised if, if the equations aren't method of streaming that worked well, I used to be working with, I shouldn't mention this cause that's a competitor of DCL, but I was with a conversion in prepress house for, for many years. And every equation done by that firm is done in mathematics, the publisher isn't asking for it. And the publisher, usually wasn't getting the MathML, but the vendor was using MathML to create the equations in the first place. So

 

Mark Gross (53:15):

Sure though. But no, I, I think there are, there are groups that are using tech LaTech is really

 

Bill Kasdorf  (53:20):

That's right. Oh yeah. Particularly I think the way I've ventured in particularly cause in engineering, checking is dominant, so yeah. That's right,

 

Mark Gross (53:28):

Right. And there's going to the methadone, you really doing very fine print. So it's really been a yang yang kind of thing over the years when we got specialized over there, that's one thing that happens over there, but that's probably true in other places also, but the general trend in places has been to try to use our XML because it is, it's a language that lets you define lots of things. I mean, there's also other areas also is an area that you may not be able to do everything that's there, but I think there's a large group and certainly as much groups that use math ML until they get to those really, you know, page long equations, which may be difficult to do.

 

Bill Kasdorf  (54:22):

Yeah. And I take that correction. So thanks to the IEEE people that said that, but you're right. What I should say is that the math is math ML or tech tech slash logic, very likely it's one or the other of those formats.

 

Mark Gross (54:35):

But

 

Bill Kasdorf  (54:38):

I can't resist pointing out that for accessibility. You want to map them out. So even if you're creating your content in

 

Bill Kasdorf  (54:48):

So word, for example you know, you can put tech equations in a word document or our mathematic, which is in the word document, but when that document gets made accessible, you actually need to convert that tech into math ML to produce the accessible

 

Mark Gross (55:06):

Context.

 

Bill Kasdorf  (55:11):

Yeah. MathML, they probably are going to get MathML pretty soon because they're going to be under pressure for those equations to be accessible.

Mark Gross (55:19):

Right. I think this development needs to be done. I think just the visualization of mathematics, it's taken a long time to catch up when you get to really fine. Fine.

 

Bill Kasdorf  (55:29):

Yeah. Actually there's some very significant work being done on meth ML right now to make it easier to use and to enter, make it easier to kind of embed it in, in workflow and say, it's a mass email refresh that's being done. Right. Coming out with kind of a simple, it was simplified subset that's what's mostly needed. And then, you know, kind of the more extensive version that can basically express anything you want to express in math, right?

 

Marianne Calilhanna (56:24):

We're coming to the top of the hour. And I think people are finishing up their lunch. I wanted to thank you both. But one of our participants, I thought shared an elegant and lovely comment to both of you. And that is "thanks to the work that Bill Mark has done in the industry. We would never have been able to prove produce articles in 24 hours. That is now a requirement for COVID-19 articles and content." So kudos to both of you for your work on XML and conversion workflow. And I think that's a lovely way to end and I want to thank everyone. Who's taken time out of their day to join us for this webinar. And I'd like to remind everyone that the DCL learning series comprises webinars, a monthly newsletter blog articles. So we hope that you take advantage of this. You can access all this information from our website at data conversion, laboratory.com. This now concludes today's webinar. I hope everyone has a great day. Thank you.

transcript-icon.png
bottom of page