<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://els-support4.river-valley.com/index.php?action=history&amp;feed=atom&amp;title=Creating_epub_files_from_Elsevier_dataset</id>
	<title>Creating epub files from Elsevier dataset - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://els-support4.river-valley.com/index.php?action=history&amp;feed=atom&amp;title=Creating_epub_files_from_Elsevier_dataset"/>
	<link rel="alternate" type="text/html" href="https://els-support4.river-valley.com/index.php?title=Creating_epub_files_from_Elsevier_dataset&amp;action=history"/>
	<updated>2026-04-16T16:29:22Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.39.5</generator>
	<entry>
		<id>https://els-support4.river-valley.com/index.php?title=Creating_epub_files_from_Elsevier_dataset&amp;diff=579&amp;oldid=prev</id>
		<title>Rishi: New page: ==&#039;&#039;&#039;What is epubgen?&#039;&#039;&#039;== epubgen is a tool for generating [http://en.wikipedia.org/wiki/EPUB ePub] files from Elsevier&#039;s Journal and Book (Stand-alone and Book-Series) CONTRAST-out datas...</title>
		<link rel="alternate" type="text/html" href="https://els-support4.river-valley.com/index.php?title=Creating_epub_files_from_Elsevier_dataset&amp;diff=579&amp;oldid=prev"/>
		<updated>2009-10-21T07:14:20Z</updated>

		<summary type="html">&lt;p&gt;New page: ==&amp;#039;&amp;#039;&amp;#039;What is epubgen?&amp;#039;&amp;#039;&amp;#039;== epubgen is a tool for generating [http://en.wikipedia.org/wiki/EPUB ePub] files from Elsevier&amp;#039;s Journal and Book (Stand-alone and Book-Series) CONTRAST-out datas...&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;==&amp;#039;&amp;#039;&amp;#039;What is epubgen?&amp;#039;&amp;#039;&amp;#039;==&lt;br /&gt;
epubgen is a tool for generating [http://en.wikipedia.org/wiki/EPUB ePub] files from Elsevier&amp;#039;s Journal and Book (Stand-alone and Book-Series) CONTRAST-out datasets. epubgen is created as a [http://en.wikipedia.org/wiki/Command-line_interface command line interface (CLI)] tool. This tool is compatible to both Linux and Windows platforms. The input is Elsevier&amp;#039;s CONTRAST-OUT dataset and the output is ofcourse the epub file. A dataset is actually an archive (in other terms a container of files). This archive may contain, the PDF, XML, stripins (gif images of MathML code), figures and mainly a dataset.xml. dataset.xml is the entry point to the dataset which may contain the details of the files including path names of the files which are in the dataset.&lt;br /&gt;
&lt;br /&gt;
This program is issued under the GNU General Public License.&lt;br /&gt;
&lt;br /&gt;
===Features===&lt;br /&gt;
# A single dataset may contain a collection of issues. By default each issues will be seperate ePub file. But if epub files of each item (ie, article) is needed, then you can split the files.&lt;br /&gt;
# In the XML file, both MathML code and as an alternative, stripins (images of MathML code) are available. This program is now set in such a way that you can select to render MathML or view the images or SVG or all of them.&lt;br /&gt;
# You can select the level of the table of contents using the command line switch. Default is 3.&lt;br /&gt;
# A parser is included in this package to check the ePub file generated.&lt;br /&gt;
# This tool will work both in Linux and Windows Platform. &lt;br /&gt;
&lt;br /&gt;
===Software dependencies===&lt;br /&gt;
# Perl 5.8.8 or higher&lt;br /&gt;
# xsltproc for Linux. For windows this program has been included in the archive. This is required to convert XML files to ePub format using XSLT stylesheets.&lt;br /&gt;
# zip program for Linux. For windows this program has been included in the archive. The ePub file is actually a zip file with extension &amp;lt;code&amp;gt;.epub&amp;lt;/code&amp;gt;.&lt;br /&gt;
# Java 1.6 or higher for parsing.&lt;br /&gt;
# Cygwin is required for Windows.&lt;br /&gt;
&lt;br /&gt;
===Installation===&lt;br /&gt;
The installation is so simple that you just need to unzip the archive. All the files will be extracted to a folder named epubgen-*.* which will be called `installation path&amp;#039; from now onwards. &amp;lt;code&amp;gt;*.*&amp;lt;/code&amp;gt; stands for the correct version number which you see in the folder you get when you unzip the archive. This folder contains &amp;lt;code&amp;gt;epubgen.pl&amp;lt;/code&amp;gt;,  &amp;lt;code&amp;gt;epubgen.sh&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;epubgen.bat&amp;lt;/code&amp;gt;, &amp;lt;code&amp;gt;epubgen.conf&amp;lt;/code&amp;gt; etc. If you add the installation path to the system variable &amp;lt;code&amp;gt;PATH&amp;lt;/code&amp;gt;, that will be the more efficient way of working. &lt;br /&gt;
&lt;br /&gt;
Firstly, edit the &amp;lt;code&amp;gt;epub.conf&amp;lt;/code&amp;gt; file to set the path of &amp;lt;code&amp;gt;Java&amp;lt;/code&amp;gt;. Even if Java path is set properly in your system variables, setting this path in &amp;lt;code&amp;gt;epub.conf&amp;lt;/code&amp;gt; is also necessary. For both Linux and Windows, seperate variables are given. If you are a Linux user, then set the variable for Linux and comment out the variable for Windows and vice versa.&lt;br /&gt;
&lt;br /&gt;
===How to create epub files?===&lt;br /&gt;
Unzip the CONTRAST-OUT dataset to any folder and use the following syntax:&lt;br /&gt;
&lt;br /&gt;
;Linux:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;font color=&amp;quot;green&amp;quot;&amp;gt;Syntax&amp;lt;/font&amp;gt;&lt;br /&gt;
 epubgen.sh -d [Options] [name of folder where dataset.xml resides] -p [folder where epub file should be generated]&lt;br /&gt;
 &lt;br /&gt;
&amp;lt;font color=&amp;quot;green&amp;quot;&amp;gt;Example&amp;lt;/font&amp;gt;&lt;br /&gt;
 &lt;br /&gt;
 epubgen.sh -d /opt/my-dataset/EWX00001158/ -p /opt/my-epub-files/&lt;br /&gt;
&lt;br /&gt;
;Windows:&lt;br /&gt;
There is difference in the usage of commands for Cygwin terminal and default command line terminal. In the following syntax, the first one is for Cygwin terminal and the other is for default command line terminal:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;font color=&amp;quot;green&amp;quot;&amp;gt;Syntax&amp;lt;/font&amp;gt;&lt;br /&gt;
 epubgen.sh -d [Options] [name of folder where dataset.xml resides] -p [folder where epub file should be generated]&lt;br /&gt;
 epubgen.pl -d [Options] [name of folder where dataset.xml resides] -p [folder where epub file should be generated]&lt;br /&gt;
&lt;br /&gt;
&amp;lt;font color=&amp;quot;green&amp;quot;&amp;gt;Example&amp;lt;/font&amp;gt;  &lt;br /&gt;
 epubgen.sh -d e:/my-dataset/EWX00001158/  -p  e:/my-epub-files/&lt;br /&gt;
 epubgen.pl -d e:\my-dataset\EWX00001158  -p  e:\my-epub-files\&lt;br /&gt;
&lt;br /&gt;
====List of options====&lt;br /&gt;
 --valid or -v         Validate the epub file produced. Default is to validate&lt;br /&gt;
 &lt;br /&gt;
 --path or -p          Path to which epub files are to be copied&lt;br /&gt;
 &lt;br /&gt;
 --nostripins or -S    Not to include stripins. Default is to include stripins&lt;br /&gt;
 &lt;br /&gt;
 --svg or -g           [yes|no] Whether to insert SVG code for math&lt;br /&gt;
                           &lt;br /&gt;
 --mathml or -m        [yes|no] Whether to retain mathml code. &lt;br /&gt;
                       Default is no. Either stripins or mathml is required however&lt;br /&gt;
 &lt;br /&gt;
 --item or -i          To split epub files item wise. Default is single epub for single issue&lt;br /&gt;
 &lt;br /&gt;
 --level or -l         Level of table of contents to be included in the toc.ncx file of epub. Default is 3&lt;br /&gt;
&lt;br /&gt;
===Project home page===&lt;br /&gt;
This project is hosted in gna.org. The URL is:&lt;br /&gt;
&lt;br /&gt;
 [https://gna.org/projects/epubgen https://gna.org/projects/epubgen]&lt;br /&gt;
&lt;br /&gt;
To get the information about how to access the source code repository, please visit the following URL:&lt;br /&gt;
&lt;br /&gt;
 [https://gna.org/svn/?group=epubgen https://gna.org/svn/?group=epubgen]&lt;/div&gt;</summary>
		<author><name>Rishi</name></author>
	</entry>
</feed>