Download  the **class.pdf2text.php** @ [https://pastebin.com/dvwySU1a][1] or [http://www.phpclasses.org/browse/file/31030.html][2] (Registration required)

Code:

    include(&#39;class.pdf2text.php&#39;);
    $a = new PDF2Text();
    $a-&gt;setFilename(&#39;filename.pdf&#39;); 
    $a-&gt;decodePDF();
    echo $a-&gt;output(); 


----------


 - `class.pdf2text.php` [Project Home][3]

 - `pdf2textclass` doesn&#39;t work with all the PDF&#39;s I&#39;ve tested, If it doesn&#39;t work for you, try [PDF Parser][4]


----------

    


  [1]: http://pastebin.com/dvwySU1a
  [2]: http://www.phpclasses.org/browse/file/31030.html
  [3]: https://webcheatsheet.com/php/reading_clean_text_from_pdf.php
  [4]: https://www.pdfparser.org/demo

Is there a way in Python to serialize a dictionary that is using a tuple as key?

e.g.

    a = {(1, 2): &#39;a&#39;}
    
simply using `json.dumps(a)` raises this error:

```none
Traceback (most recent call last):
  File &quot;&lt;stdin&gt;&quot;, line 1, in &lt;module&gt;
  File &quot;/usr/lib/python2.6/json/__init__.py&quot;, line 230, in dumps
    return _default_encoder.encode(obj)
  File &quot;/usr/lib/python2.6/json/encoder.py&quot;, line 367, in encode
    chunks = list(self.iterencode(o))
  File &quot;/usr/lib/python2.6/json/encoder.py&quot;, line 309, in _iterencode
    for chunk in self._iterencode_dict(o, markers):
  File &quot;/usr/lib/python2.6/json/encoder.py&quot;, line 268, in _iterencode_dict
    raise TypeError(&quot;key {0!r} is not a string&quot;.format(key))
TypeError: key (1, 2) is not a string
```

JSON serialize a dictionary with tuples as key

I&#39;m writing a reusable django app and I need to ensure that its models are only sync&#39;ed when the app is in test mode. I&#39;ve tried to use a custom DjangoTestRunner, but I found no examples of how to do that (the documentation only shows how to define a custom test runner).

So, does anybody have an idea of how to do it?

**EDIT**

Here&#39;s how I&#39;m doing it:

    #in settings.py
    import sys
    TEST = &#39;test&#39; in sys.argv

Hope it helps.

Detect django testing mode

How to extract text from the PDF document *using PHP*?

(I can&#39;t use other tools, I don&#39;t have root access)

I&#39;ve found some functions working for plain text, but they don&#39;t handle well Unicode characters:

http://www.hashbangcode.com/blog/zend-lucene-and-pdf-documents-part-2-pdf-data-extraction-437.html

How to extract text from the PDF document?

How to extract text from the PDF document using PHP?
(I can't use other tools, I don't have root access)
I've found some functions working for plain text, but they don't handle well Unicode characters:
<a href="http://www.hashbangcode.com/blog/zend-lucene-and-pdf-documents-part-2-pdf-data-extraction-437.html" target="_blank" rel="noopener noreferrer">http://www.hashbangcode.com/blog/zend-lucene-and-pdf-documents-part-2-pdf-data-extraction-437.html</a>

I&#39;m trying to JSON encode some objects in PHP, but I&#39;m facing a problem: I want to encode data which is kept by a class private members.
I found this piece of code to encode this object by calling an encode function like:

    public function encodeJSON() 
    { 
        foreach ($this as $key =&gt; $value) 
        { 
            $json-&gt;$key = $value; 
        } 
        return json_encode($json); 
    }

However, this only works if the object I want to encode does not contain other objects inside, which is the case. How can I do to encode not only the &quot;outer&quot; object, but encode as well any members that are objects too?

PHP json_encode class private members

I have an array:

    $a = array(&#39;foo&#39; =&gt; &#39;fooMe&#39;);

and I do:

    print_r($a);

which prints:

    Array ( [foo] =&gt; printme )

Is there a function, so when doing:

    needed_function(&#39;    Array ( [foo] =&gt; printme )&#39;);

I will get the array `array(&#39;foo&#39; =&gt; &#39;fooMe&#39;);` back?

How create an array from the output of an array printed with print_r?

Can I programmatically get the source code of a function by its name?

Like:

    function blah($a, $b) { return $a*$b; }
    echo getFunctionCode(&quot;blah&quot;);

is it possible?  

Are there any php self-descriptive functions to reconstruct function/class code? (I mean instead of getting source code right from the source file.)

In Java there exists: http://java.sun.com/developer/technicalArticles/ALT/Reflection/

Reconstruct / get source code of a PHP function

`stdClass Object ([Sector] =&gt; Manufacturing [Date Found] =&gt; 2010-05-03 08:15:19) `

So I can access `[Sector]` by using `$object-&gt;Sector` but how can I access `[Date Found]` ?

Accessing Class Properties with Spaces

I&#39;m writing a script that is registered as an endpoint for a webhook. I know that it&#39;s successfully registered because I&#39;m writing the header of every request to my server logs. Here&#39;s a sample:

    Content-Type: text/xml; charset=UTF-8
    User-Agent: Jakarta Commons-HttpClient/3.1
    Host: =={obfuscated}== 
    Content-Length: 1918

The API that I&#39;ve registered with is POST-ing a JSON object to my script, and I&#39;d like to parse that object using PHP. As you can see from the request header, there&#39;s a nice big fat JSON object waiting to be parsed. It seems straightforward, but it hasn&#39;t been. 

At first I tried using `$_POST[&#39;json&#39;]` or just `$_POST` but since the data isn&#39;t in an array, I wasn&#39;t really sure how to access it like that.

I&#39;ve tried using `file_get_contents(&#39;php://input&#39;)` and `fopen(&#39;php://input&#39;, &#39;r&#39;)` with and without `json_decode()` but no luck. I can&#39;t use `http_get_request_body()` since the server I&#39;m on doesn&#39;t have PECL and that&#39;s out of my control.

Are there any other ways to interact with the POST-ed JSON object that I&#39;m missing? Thanks!

Issue reading HTTP request body from a JSON POST in PHP

I have an UIWebView with a pdf-file. It works fine. But how can i enable zooming on the pdf-file? 

Enable zooming/pinch on UIWebView

&gt; **Possible Duplicate:**  
&gt; [PDF Generation Library for Java](https://stackoverflow.com/questions/3986105/pdf-generation-library-for-java)  

&lt;!-- End of automatically inserted text --&gt;

I&#39;m working on an invoice program for a local accounting company.
What is a good way to create a PDF file with Java? Any good library?
I&#39;m totally new to PDF export (On any language).

Create PDF with Java

Is there a way to get the stock Android browser to auto-open a PDF, Word or other typical file without having to go through the process of downloading the file and then getting the user to open the file from the Downloads app or the Notification bar?

We have a web application that has a lot of documents that we&#39;d like to include and not have to convert to HTML, but making the user download the file and manually open it is not easy to train users on.

On iOS, these files all display inline in the browser.  I&#39;d like a way to get the browser to auto-launch the files into Acrobat Reader or QuickOffice or whatever program the user has to display them.

Does anyone know a way to do that?  I know that Google Docs has some PDF viewing support, but people using our web app may not have public Internet access in all cases, and may be hitting on a local web server.

How to display a PDF via Android web browser without &quot;downloading&quot; first

I am trying to draw contents of `scrollview` into a `PDF` context &amp; I am facing problem with `pagination`.

 Following Code I have used:  

    - (void)renderTheView:(UIView *)view inPDFContext:(CGContextRef)pdfContext
    {
        // Creating frame.
        CGFloat heightOfPdf = [[[self attributes] objectForKey:SRCPdfHeight] floatValue];
        CGFloat widthOfPdf  = [[[self attributes] objectForKey:SRCPdfWidth] floatValue];
        CGRect pdfFrame = CGRectMake(0, 0, widthOfPdf, heightOfPdf);   
        CGRect viewFrame = [view frame];
    
        if ([view isKindOfClass:[UIScrollView class]])
        {
            viewFrame.size.height = ((UIScrollView *)view).contentSize.height;
            [view setFrame:viewFrame];
        }
        // Calculates number of pages.
        NSUInteger totalNumberOfPages = ceil(viewFrame.size.height/heightOfPdf);
            
        // Start rendering.
        for (NSUInteger pageNumber = 0; pageNumber&lt;totalNumberOfPages; pageNumber++)
        {
           // Starts our first page.
           CGContextBeginPage (pdfContext, &amp;pdfFrame);	
           // Turn PDF upsidedown
           CGAffineTransform transform = CGAffineTransformIdentity;
           transform = CGAffineTransformMakeTranslation(0,view.bounds.size.height);
           transform = CGAffineTransformScale(transform, 1.0, -1.0);
           CGContextConcatCTM(pdfContext, transform);
    
           // Calculate amount of y to be displace.
           CGFloat ty = (heightOfPdf*(pageNumber));
                
           CGContextTranslateCTM(pdfContext,0,-ty);
           [view.layer renderInContext:pdfContext];
        		
           // We are done drawing to this page, let&#39;s end it.
           CGContextEndPage (pdfContext);
       }	
    }

It creates required number of pages but places the content wrongly. Following figure explains it.![enter image description here][1]  

Is there anything wrong in my code? 

  [1]: http://i.stack.imgur.com/DGpvp.png


Contex Drawing + Pagination

I found this neat command to merge multiple PDF into one, using Ghostscript:

    gs -dBATCH -dNOPAUSE -q -sDEVICE=pdfwrite -sOutputFile=out.pdf in1.pdf in2.pdf

The resulting size is smaller than the combined size of the 2 PDFs.

Running the command with a single file as input still results to a smaller size output file.

Is there an option on Ghostscript to just copy the pages as they appear on merging without doing any compression?

If not, is it possible that the Ghostscript compression is so good that it will result in absolutely no loss in quality? 



Ghostscript to merge PDFs compresses the result

How can I add text from a DIV to a textarea?

I have this now:

        $(&#39;.oquote&#39;).click(function() { 
          $(&#39;#replyBox&#39;).slideDown(&#39;slow&#39;, function() {
          var quote = $(&#39;.container&#39;).text();   
             $(&#39;#replyBox&#39;).val($(&#39;#replyBox&#39;).val()+quote);   
            // Animation complete.
          });    
        });

Add text to textarea - Jquery

I need to extract the last line from a number of very large (several hundred megabyte) text files to get certain data.  Currently, I am using python to cycle through all the lines until the file is empty and then I process the last line returned, but I am certain there is a more efficient way to do this.  

What is the best way to retrieve just the last line of a text file using python?

Efficiently finding the last line in a text file

Can anyone tell me if it&#39;s possible to find an element based on its content rather than by an **ID** or **class**?

I am attempting to find elements that don&#39;t have distinct classes or IDs. (Then I then need to find that element&#39;s parent.)

How can I find elements by text content with jQuery?

I know it sounds easy.  I need to put a text in center, but when the text is too long it needs to go below, but still align in the center of my xml.

Here&#39;s my code :

     &lt;LinearLayout
  		android:layout_width=&quot;wrap_content&quot;
  		android:layout_height=&quot;wrap_content&quot;
  		android:id=&quot;@+id/showdescriptioncontenttitle&quot;
  		android:paddingTop=&quot;10dp&quot;
  		android:paddingBottom=&quot;10dp&quot;
  		android:layout_centerHorizontal=&quot;true&quot;
  	&gt;
  		&lt;TextView 
  			android:id=&quot;@+id/showdescriptiontitle&quot;
  			android:text=&quot;Title&quot;
  			android:textSize=&quot;35dp&quot;
  			android:layout_width=&quot;wrap_content&quot;
  			android:layout_height=&quot;wrap_content&quot;
  		/&gt;
  	&lt;/LinearLayout&gt;

I put paddingTop and Bottom because I need some space.
PS: My code is bigger; it&#39;s in a RelativeLayout.

align text center with android

Good day. 

The one thing I now hate about Haskell is quantity of packages for working with string.

First I used native Haskell `[Char]` strings, but when I tried to start using hackage libraries then completely lost in endless conversions. Every package seem to use different strings implementation, some adopts their own handmade thing.

Next I rewrote my code with `Data.Text` strings and `OverloadedStrings` extension, I chose `Text` because it has a wider set of functions, but it seems many projects prefer `ByteString`.&lt;br /&gt;
Someone could give short reasoning why to use one or other?

PS: btw how to convert from `Text` to `ByteString`?

&gt; Couldn&#39;t match expected type
&gt; *Data.ByteString.Lazy.Internal.ByteString*
&gt;            against inferred type *Text*
&gt;       Expected type: IO Data.ByteString.Lazy.Internal.ByteString
&gt;       Inferred type: IO Text

I tried `encodeUtf8` from `Data.Text.Encoding`, but no luck:

&gt; Couldn&#39;t match expected type
&gt; *Data.ByteString.Lazy.Internal.ByteString*
&gt;            against inferred type *Data.ByteString.Internal.ByteString*


**UPD:**

Thanks for responses, that *Chunks goodness looks like way to go, but I somewhat shocked with result, my original function looked like this:

    htmlToItems :: Text -&gt; [Item]
    htmlToItems =
        getItems . parseTags . convertFuzzy Discard &quot;CP1251&quot; &quot;UTF8&quot;


And now became:

    htmlToItems :: Text -&gt; [Item]
    htmlToItems =
        getItems . parseTags . fromLazyBS . convertFuzzy Discard &quot;CP1251&quot; &quot;UTF8&quot; . toLazyBS
        where
          toLazyBS t = fromChunks [encodeUtf8 t]
          fromLazyBS t = decodeUtf8 $ intercalate &quot;&quot; $ toChunks t


And yes, this function is not working because its wrong, if we supply `Text` to it, then we&#39;re confident this text is properly encoded and ready to use and converting it is stupid thing to do, but such a verbose conversion still has to take place somewhere outside `htmltoItems`.

Text or Bytestring

I am developing a front end of a web app using NetBeans IDE 7.0.1. Recently I had a very nasty bug, which I finally fixed.

Say I have code

    var element = &#39;&lt;input size=&quot;3&quot; id=&quot;foo&quot; name=&quot;elements[foo][0]&quot; /&gt;&#39;;
    $(&#39;#bar&#39;).append(element);

I noticed that something gone wrong when I saw that `size` attribute doesn&#39;t work in Chrome (didn&#39;t checked in other browsers). When I opened that element in Inspector, it was interpreted as something like

    &lt;input id=&quot;&amp;quot;3&amp;quot;&quot; name=&quot;&amp;quot;elements[foo][0]&amp;quot;&quot; 
        size=&quot;&amp;quot;foo&amp;quot;&quot; /&gt;

Which was rather strange. After manually retyping the `element` string character-in-character, the bug was gone. When I undo&#39;ed that change I noticed that Netbeans alerted me about some Unicode characters in my old code. It was `\u200b` - a zero width spaces after each &#39;=&#39;, between &#39;][&#39; and in the end of the string. So the string appeared normal because zero width spaces wasn&#39;t displayed, but after escaping them my string was

    &#39;&lt;input size=\u200b&quot;3&quot; id=\u200b&quot;foo&quot; name=\u200b&quot;elements[foo]\u200b[0]&quot; /&gt;\u200b&#39;

Now where the hell did I get them?

I&#39;m not sure where did I copied the code of `element` from, but it&#39;s definitely one of the following:

 - Other pane of Netbeans Editor with HTML template file;
 - Google Chrome Inspector, &#39;Copy as HTML&#39; action;
 - Google Chrome source view page (very doubtfully).

But I can&#39;t reproduce the bug with neither of that.

I use Netbeans 7.0.1 and Google Chrome 13.0 under Windows 7. No keyboard switchers or anything like it is running. Also I&#39;m using Git for version control, but I didn&#39;t pulled that code, so it is very unlikely that Git is to blame. It can&#39;t be a stupid joke of my colleagues, because they are quite well-mannered.

Any suggestions who messed up my code?

\u200b (Zero width space) characters in my JS code. Where did they come from?

does anyone have an idea, why this Python 3.2 code 

    try:    
        raise Exception(&#39;X&#39;)
    except Exception as e:
        print(&quot;Error {0}&quot;.format(str(e)))

works without problem (apart of unicode encoding in windows shell :/),
but this 

    try:    
        raise Exception(&#39;X&#39;)
    except Exception as e:
        print(&quot;Error {0}&quot;.format(str(e, encoding = &#39;utf-8&#39;)))

throws *TypeError: coercing to str: need bytes, bytearray or buffer-like object, Exception found* ?

How to convert an Error to a string with custom encoding?

**Edit**

It does not works either, if there is  \u2019 in message:

    try:    
        raise Exception(msg)
    except Exception as e:
        b = bytes(str(e), encoding = &#39;utf-8&#39;)
        print(&quot;Error {0}&quot;.format(str(b, encoding = &#39;utf-8&#39;)))

But why cannot str() convert an exception internally to bytes?



Converting Exception to a string in Python 3

The following code:

    var text = (new WebClient()).DownloadString(&quot;http://export.arxiv.org/api/query?search_query=au:Freidel_L*&amp;start=0&amp;max_results=20&quot;));

results in a variable `text` that contains, among many other things, the string

&gt; &quot;$&#206;&#186;$-Minkowski space, scalar field, and the issue of Lorentz invariance&quot;

However, when I visit that URL in Firefox, I get

&gt; $κ$-Minkowski space, scalar field, and the issue of Lorentz invariance

which is actually correct. I also tried

    var data = (new WebClient()).DownloadData(&quot;http://export.arxiv.org/api/query?search_query=au:Freidel_L*&amp;start=0&amp;max_results=20&quot;);
    var text = System.Text.UTF8Encoding.Default.GetString(data);

but this gave the same problem.

I&#39;m not sure where the fault lies here. Is the feed lying about being UTF8-encoded, and the browser is smart enough to figure that out, but not `WebClient`? Is the feed properly UTF8-encoded, but `WebClient` is failing in some other way? What can I do to mitigate this?

WebClient.DownloadString results in mangled characters due to encoding issues, but the browser is OK

In Python API, is there a way to extract the unicode code point of a single character?

**Edit:** In case it matters, I&#39;m using Python 2.7.

Get unicode code point of a character using Python

I am doing compressing of JavaScript files and the compressor is complaining that my files have `&#239;&#187;&#191;` character in them.   

How can I search for these characters and remove them? 


Content Type	Original Author	Original Content on Stackoverflow
Question	Sfisioza	View Question on Stackoverflow
Solution 1 - Php	Pedro Lobito	View Answer on Stackoverflow

How to extract text from the PDF document?

Php Problem Overview

Php Solutions

Solution 1 - Php

Detect django testing mode

JSON serialize a dictionary with tuples as key

Attributions