GithubHelp home page GithubHelp logo

xls-reader's Introduction

XLSParser is reasonably fast PHP library intended to parse Microsoft Excel legacy binary XLS formats. It was written because all PHP implementations where too slow. Code is more or less direct port of python's excellent xlrd package. Library is very early alpha. I made it a long time ago.

Feedback

Issues and pull requests are accepted.

Requirements

  • PHP 5.4.0 or newer.
  • Multibyte string extension (mbstring) to handle UTF-16LE encoding, used in newer Excel files.
  • Little endian system because of PHP's unpack/pack. If you're not on Sparc, you should be covered.

Install

Via command line: composer require laacz/xls-parser.

Tests

Install dependencies with composer install --dev, then run tests with vendor/bin/phpunit.

Usage

KISS. Provide filename and it gets loaded or parsed.

$book = new laacz\XLSParser\Book(file_get_content('workbook.xls'));

Accessing sheets

Sheets can be accessed via their numeric index or name. Since Sheet object implements ArrayAccess and IteratorAggregate, you can do that too.

$sheet = $book[0];
$sheet = $book['Page1'];
$sheet = $book['Vājprāts'];

Accessing cells

Cells also can be accessed as with sheets. Index starts from zero.

$row = $sheet[0];
$cell = $sheet[$sheet->nrows - 1][1];

To get value of a cell, cast it to string (or use it in such context) or get value attribute:

$val1 = $cell->value;
$val2 = (string)$cell;

Or, if you wish...

$val1 = $book[0][0][0]->value;

Formatting

By now formatting can be accessed raw. In short - sheet contains mapping array rich_text_runlist_map[][], which has arrays with two elements - position and font reference. First is position where style is being applied from, second is number which refers to book's font_list[], which on its part contains format description.

Excel dates

Library does its best to parse dates found within cells. It returns string in common date format: 'yyyy-mm-dd hh:mi:ss'. For example: 2014-12-31 12:59:59.

Wishlist

  • Memory efficiency does not exist in context of this library.
  • Performance might be better.
  • Abstract formatting.
  • Add helper methods for common tasks - returning columns, ranges, etc.

xls-reader's People

Contributors

laacz avatar

Stargazers

 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.