This command:

    iconv -f utf-8 -t utf-8 -c file.txt

will clean up your UTF-8 file, skipping all the invalid characters.

    -f is the source format
    -t the target format
    -c skips any invalid sequence

[`iconv`](http://linux.die.net/man/1/iconv)
can do it

    iconv -f cp1252 foo.txt



Your method must read byte by byte and fully understand and appreciate the byte wise construction of characters. The simplest method is to use an editor which will read anything but only output UTF-8 characters. Textpad is one choice.

None of the methods here or on any other similar questions worked for me.
In the end what worked was simply opening the file in Sublime Text 2. Go to File &gt; Reopen with Encoding &gt; UTF-8. Copy the entire content of the file into a new file and save it.

May not be the expected solution but putting this out here in case it helps anyone, since I&#39;ve been struggling for hours with this.

    cat foo.txt | strings -n 8 &gt; bar.txt

will do the job.

Is there an HTML entity for a check mark?

![green checkmark][1]


  [1]: http://i.stack.imgur.com/bXCtC.png

I&#39;ve searched for it in various html entities cheat sheets but didn&#39;t find it

HTML entity for check mark

I am not able to understand how d3.call() works and when and where to use that. [Here](http://alignedleft.com/tutorials/d3/axes/) is the tutorial link that I&#39;m trying to complete.

Can someone please explain specifically what this piece is doing

    var xAxis = d3.svg.axis()
                  .scale(xScale)
                  .orient(&quot;bottom&quot;);

    svg.append(&quot;g&quot;).call(xAxis);


Javascript library d3 call function

I have a bunch of Arabic, English, Russian files which are encoded in utf-8. Trying to process these files using a Perl script, I get this error:

    Malformed UTF-8 character (fatal)
Manually checking the content of these files, I found some strange characters in them.
Now I&#39;m looking for a way to automatically remove these characters from the files.

Is there anyway to do it?





How to remove non UTF-8 characters from text file

I have a bunch of Arabic, English, Russian files which are encoded in utf-8. Trying to process these files using a Perl script, I get this error:
<pre><code class="hljs language-scss">Malformed UTF-8 character (fatal)
</code></pre>
Manually checking the content of these files, I found some strange characters in them.
Now I'm looking for a way to automatically remove these characters from the files.
Is there anyway to do it?

When I learned C, teacher told me all day long: &quot;Don&#39;t use goto, that&#39;s a bad habit, it&#39;s ugly, it&#39;s dangerous!&quot; and so on.

Why then, do some kernel programmers use `goto`, for example [in this function][1], where it could be replaced with a simple 

    while(condition) {} 

or 

    do {} while(condition);

I can&#39;t understand that. Is it better in some cases to use goto instead of `while`/`do`-`while`? And if so, why?


  [1]: https://github.com/torvalds/linux/blob/391e43da797a96aeb65410281891f6d0b0e9611c/kernel/sched/clock.c#L141

Why do some kernel programmers use goto instead of simple while loops?

In Linux, how do I remove folders with a certain name which are nested deep in a folder hierarchy?

The following paths are under a folder and I would like to remove all folders named `a`.  

    1/2/3/a
    1/2/3/b
    10/20/30/a
    10/20/30/b
    100/200/300/a
    100/200/300/b

What Linux command should I use from the parent folder?

How to remove folders with a certain name

I&#39;d like to be able to detect which particular Linux flavor is installed on a computer, e.g. Ubuntu vs Fedora, via a command line command.

Some people recommend `uname -a`, but that only reports the kernel version.

How do I identify the particular Linux flavor via command line?

How is it possible to change the default shell? The `env` command currently says:

    SHELL=/bin/tcsh

and I want to change that to Bash.


Changing default shell in Linux

I am running RHEL6, and I have exported an environment variable like this:

    export DISPLAY=:0

That variable is lost when the terminal is closed. How do I permanently add this so that this variable value always exists with a particular user?

How to permanently export a variable in Linux?

I&#39;ve seen this example:

    hello=ho02123ware38384you443d34o3434ingtod38384day
    echo ${hello//[0-9]/}

Which follows this syntax: `${variable//pattern/replacement}`

Unfortunately the `pattern` field doesn&#39;t seem to support full regex syntax (if I use `.` or `\s`, for example, it tries to match the literal characters).

How can I search/replace a string using full regex syntax?





Search and replace in bash using regular expressions

I have a bash variable depth and I would like to test if it equals 0. In case yes, I want to stop executing of script. So far I have:

    zero=0;

    if [ $depth -eq $zero ]; then
    	echo &quot;false&quot;;
    	exit;
    fi

Unfortunately, this leads to:

     [: -eq: unary operator expected

(might be a bit inaccurate due to translation)

Please, how can I modify my script to get it working?



Check if bash variable equals 0

How do you create a Bash script to activate a Python virtualenv?

I have a directory structure like:

    .env
        bin
            activate
            ...other virtualenv files...
    src
        shell.sh
        ...my code...

I can activate my virtualenv by:

    user@localhost:src$ . ../.env/bin/activate
    (.env)user@localhost:src$

However, doing the same from a Bash script does nothing:

    user@localhost:src$ cat shell.sh
    #!/bin/bash
    . ../.env/bin/activate
    user@localhost:src$ ./shell.sh
    user@localhost:src$ 

What am I doing wrong?

How to source virtualenv activate in a Bash script

I have a lot of this kind of string and I want to find a command to convert it in ascii, I tried with `echo -e` and `od`, but it did not work.

    0xA7.0x9B.0x46.0x8D.0x1E.0x52.0xA7.0x9B.0x7B.0x31.0xD2

Conversion hex string into ascii in bash command line

There exists a `File.ReadAllLines` but not a `Stream.ReadAllLines`.

    using (Stream stream = Assembly.GetExecutingAssembly().GetManifestResourceStream(&quot;Test_Resources.Resources.Accounts.txt&quot;))
    using (StreamReader reader = new StreamReader(stream))
    {
        // Would prefer string[] result = reader.ReadAllLines();
        string result = reader.ReadToEnd();
    }

Does there exist a way to do this or do I have to manually loop through the file line by line?

ReadAllLines for a Stream object?

Is there a way to integrate a border around text like the image below?
   
![text border][1]

  [1]: http://i.stack.imgur.com/DeWjI.jpg

Text border using css (border around text)

I am trying to load a text file into my JavaScript file and then read the lines from that file in order to get information, and I tried the FileReader but it does not seem to be working. Can anyone help?

    function analyze(){
	   var f = new FileReader();
	
	   f.onloadend = function(){
		   console.log(&quot;success&quot;);
	   }
	   f.readAsText(&quot;cities.txt&quot;);
    }

How to read text file in JavaScript

I have arabic text, therefore I set gravity to right in order to start text from right side. Text starts from right now. But another issue is text starts to render from the top of the page. But I need to vertically center the text.
Although I tried several variations I couldnt make it vertically center.

Here is the sample of my xml file.

    &lt;LinearLayout
                android:id=&quot;@+id/linearLayout5&quot;
                android:layout_width=&quot;fill_parent&quot;
                android:layout_height=&quot;wrap_content&quot;
                android:gravity=&quot;right&quot;
                android:orientation=&quot;vertical&quot; &gt;
    
                &lt;TextView
                    android:id=&quot;@+id/textView2&quot;
                    android:layout_width=&quot;wrap_content&quot;
                    android:layout_height=&quot;wrap_content&quot;
                    android:layout_gravity=&quot;center_vertical&quot;
                    android:layout_marginBottom=&quot;23dp&quot;
                    android:gravity=&quot;right&quot;
                    android:padding=&quot;@dimen/padding_maintextview&quot;
                    android:text=&quot;@string/text&quot;
                    android:textAppearance=&quot;?android:attr/textAppearanceMedium&quot;
                    android:textSize=&quot;20sp&quot; /&gt;
            &lt;/LinearLayout&gt;

Problem is with above textview.

Here I have put whole xml file.


    &lt;?xml version=&quot;1.0&quot; encoding=&quot;utf-8&quot;?&gt;
    &lt;RelativeLayout xmlns:android=&quot;http://schemas.android.com/apk/res/android&quot;
        android:layout_width=&quot;fill_parent&quot;
        android:layout_height=&quot;wrap_content&quot;
        android:background=&quot;@drawable/page1background&quot;
        android:paddingRight=&quot;@dimen/padding_large&quot; &gt;
    
        &lt;TextView
            android:id=&quot;@+id/textView1&quot;
            android:layout_width=&quot;196dp&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_centerHorizontal=&quot;true&quot;
            android:gravity=&quot;center_horizontal&quot;
            android:paddingTop=&quot;@dimen/padding_Title_Top&quot;
            android:text=&quot;@string/text&quot;
            android:textAppearance=&quot;?android:attr/textAppearanceMedium&quot;
            android:textSize=&quot;20sp&quot; /&gt;
    
        &lt;LinearLayout
            android:id=&quot;@+id/linearLayout1&quot;
            android:layout_width=&quot;wrap_content&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_below=&quot;@id/textView1&quot;
            android:gravity=&quot;center_horizontal&quot;
            android:orientation=&quot;vertical&quot; &gt;
    
            &lt;View
                android:id=&quot;@+id/view1&quot;
                android:layout_width=&quot;fill_parent&quot;
                android:layout_height=&quot;5dp&quot; /&gt;
        &lt;/LinearLayout&gt;
    
        &lt;ScrollView
            android:layout_width=&quot;fill_parent&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_above=&quot;@id/linearLayout2&quot;
            android:layout_below=&quot;@id/linearLayout1&quot;
            android:layout_gravity=&quot;center&quot;
            android:padding=&quot;@dimen/padding_maintextview&quot; &gt;
    
            &lt;LinearLayout
                android:id=&quot;@+id/linearLayout5&quot;
                android:layout_width=&quot;fill_parent&quot;
                android:layout_height=&quot;wrap_content&quot;
                android:gravity=&quot;right&quot;
                android:orientation=&quot;vertical&quot; &gt;
    
                &lt;TextView
                    android:id=&quot;@+id/textView2&quot;
                    android:layout_width=&quot;wrap_content&quot;
                    android:layout_height=&quot;wrap_content&quot;
                    android:layout_gravity=&quot;center_vertical&quot;
                    android:layout_marginBottom=&quot;23dp&quot;
                    android:gravity=&quot;right&quot;
                    android:padding=&quot;@dimen/padding_maintextview&quot;
                    android:text=&quot;@string/text&quot;
                    android:textAppearance=&quot;?android:attr/textAppearanceMedium&quot;
                    android:textSize=&quot;20sp&quot; /&gt;
            &lt;/LinearLayout&gt;
        &lt;/ScrollView&gt;
    
        &lt;LinearLayout
            android:id=&quot;@+id/linearLayout2&quot;
            android:layout_width=&quot;wrap_content&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_alignParentBottom=&quot;true&quot;
            android:layout_centerHorizontal=&quot;true&quot; &gt;
    
            &lt;View
                android:id=&quot;@+id/view2&quot;
                android:layout_width=&quot;fill_parent&quot;
                android:layout_height=&quot;100dp&quot; /&gt;
        &lt;/LinearLayout&gt;
    
        &lt;LinearLayout
            android:id=&quot;@+id/linearLayout3&quot;
            android:layout_width=&quot;wrap_content&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_alignParentBottom=&quot;true&quot;
            android:layout_centerHorizontal=&quot;true&quot; &gt;
    
            &lt;ImageButton
                android:id=&quot;@+id/back_arrow&quot;
                android:layout_width=&quot;0dip&quot;
                android:layout_height=&quot;wrap_content&quot;
                android:layout_marginBottom=&quot;30dp&quot;
                android:layout_marginRight=&quot;45dp&quot;
                android:layout_weight=&quot;.5&quot;
                android:background=&quot;@drawable/backbut&quot;
                android:contentDescription=&quot;@string/Description&quot;
                android:onClick=&quot;onClickBtn&quot;
                android:src=&quot;@drawable/backarrowpress&quot; /&gt;
    
            &lt;ImageButton
                android:id=&quot;@+id/copyButton&quot;
                android:layout_width=&quot;0dip&quot;
                android:layout_height=&quot;wrap_content&quot;
                android:layout_marginLeft=&quot;45dp&quot;
                android:layout_weight=&quot;.5&quot;
                android:background=&quot;@drawable/copy&quot;
                android:contentDescription=&quot;@string/Description&quot;
                android:onClick=&quot;onClickBtn&quot; /&gt;
        &lt;/LinearLayout&gt;
    
    &lt;/RelativeLayout&gt;

Can anybody show me where I have done the mistake? I think problem is clear. If not tell me in comments.

Herewith I have appended updated code after review your answers.

    &lt;?xml version=&quot;1.0&quot; encoding=&quot;utf-8&quot;?&gt;
    &lt;RelativeLayout xmlns:android=&quot;http://schemas.android.com/apk/res/android&quot;
        android:layout_width=&quot;fill_parent&quot;
        android:layout_height=&quot;wrap_content&quot;
        android:background=&quot;@drawable/page1background&quot;
        android:paddingRight=&quot;@dimen/padding_large&quot; &gt;
    
        &lt;TextView
            android:id=&quot;@+id/textView1&quot;
            android:layout_width=&quot;196dp&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_centerHorizontal=&quot;true&quot;
            android:gravity=&quot;center_horizontal&quot;
            android:paddingTop=&quot;@dimen/padding_Title_Top&quot;
            android:text=&quot;@string/text&quot;
            android:textAppearance=&quot;?android:attr/textAppearanceMedium&quot;
            android:textSize=&quot;20sp&quot; /&gt;
    
        &lt;LinearLayout
            android:id=&quot;@+id/linearLayout1&quot;
            android:layout_width=&quot;wrap_content&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_below=&quot;@id/textView1&quot;
            android:gravity=&quot;center_horizontal&quot;
            android:orientation=&quot;vertical&quot; &gt;
    
            &lt;View
                android:id=&quot;@+id/view1&quot;
                android:layout_width=&quot;fill_parent&quot;
                android:layout_height=&quot;5dp&quot; /&gt;
        &lt;/LinearLayout&gt;
    
        &lt;ScrollView
            android:layout_width=&quot;fill_parent&quot;
            android:layout_height=&quot;fill_parent&quot;
            android:layout_above=&quot;@id/linearLayout2&quot;
            android:layout_below=&quot;@id/linearLayout1&quot;
            android:layout_gravity=&quot;center&quot;
            android:layout_centerInParent=&quot;true&quot;
            android:padding=&quot;@dimen/padding_maintextview&quot; &gt;
    
            &lt;LinearLayout
                android:id=&quot;@+id/linearLayout5&quot;
                android:layout_width=&quot;fill_parent&quot;
                android:layout_height=&quot;wrap_content&quot;
                android:gravity=&quot;right&quot;
                android:orientation=&quot;vertical&quot; &gt;
    
                &lt;TextView
                    android:id=&quot;@+id/textView2&quot;
                    android:layout_width=&quot;fill_parent&quot;
                    android:layout_height=&quot;fill_parent&quot;
                    android:layout_gravity=&quot;center_vertical&quot;
                    android:layout_marginBottom=&quot;23dp&quot;
                    android:gravity=&quot;center_vertical|right&quot;
                    android:padding=&quot;@dimen/padding_maintextview&quot;
                    android:text=&quot;@string/text&quot;
                    android:textAppearance=&quot;?android:attr/textAppearanceMedium&quot;
                    android:textSize=&quot;20sp&quot; /&gt;
            &lt;/LinearLayout&gt;
        &lt;/ScrollView&gt;
    
        &lt;LinearLayout
            android:id=&quot;@+id/linearLayout2&quot;
            android:layout_width=&quot;wrap_content&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_alignParentBottom=&quot;true&quot;
            android:layout_centerHorizontal=&quot;true&quot; &gt;
    
            &lt;View
                android:id=&quot;@+id/view2&quot;
                android:layout_width=&quot;fill_parent&quot;
                android:layout_height=&quot;100dp&quot; /&gt;
        &lt;/LinearLayout&gt;
    
        &lt;LinearLayout
            android:id=&quot;@+id/linearLayout3&quot;
            android:layout_width=&quot;wrap_content&quot;
            android:layout_height=&quot;wrap_content&quot;
            android:layout_alignParentBottom=&quot;true&quot;
            android:layout_centerHorizontal=&quot;true&quot; &gt;
    
            &lt;ImageButton
                android:id=&quot;@+id/back_arrow&quot;
                android:layout_width=&quot;0dip&quot;
                android:layout_height=&quot;wrap_content&quot;
                android:layout_marginBottom=&quot;30dp&quot;
                android:layout_marginRight=&quot;45dp&quot;
                android:layout_weight=&quot;.5&quot;
                android:background=&quot;@drawable/backbut&quot;
                android:contentDescription=&quot;@string/Description&quot;
                android:onClick=&quot;onClickBtn&quot;
                android:src=&quot;@drawable/backarrowpress&quot; /&gt;
    
            &lt;ImageButton
                android:id=&quot;@+id/copyButton&quot;
                android:layout_width=&quot;0dip&quot;
                android:layout_height=&quot;wrap_content&quot;
                android:layout_marginLeft=&quot;45dp&quot;
                android:layout_weight=&quot;.5&quot;
                android:background=&quot;@drawable/copy&quot;
                android:contentDescription=&quot;@string/Description&quot;
                android:onClick=&quot;onClickBtn&quot; /&gt;
        &lt;/LinearLayout&gt;
    
    &lt;/RelativeLayout&gt;
But I am in same situation. No text is vertically centered


how to align text vertically center in android

I am trying to make a scatter plot and annotate data points with different numbers from a list.
So, for example, I want to plot `y` vs `x` and annotate with corresponding numbers from `n`.


    y = [2.56422, 3.77284, 3.52623, 3.51468, 3.02199]
    z = [0.15, 0.3, 0.45, 0.6, 0.75]
    n = [58, 651, 393, 203, 123]
    ax = fig.add_subplot(111)
    ax1.scatter(z, y, fmt=&#39;o&#39;)


Any ideas?



Matplotlib scatter plot with different text at each data point

I am trying to parse a CSV file, ideally using weka.core.converters.CSVLoader.
However the file I have is not a valid UTF-8 file.
It is mostly a UTF-8 file but some of the field values are in different encodings,
so there is no encoding in which the whole file is valid,
but I need to parse it anyway.
Apart from using java libraries like Weka, I am mainly working in Scala.
I am not even able to read the file usin scala.io.Source:
For example

    Source.
      fromFile(filename)(&quot;UTF-8&quot;).
      foreach(print);

throws:

        java.nio.charset.MalformedInputException: Input length = 1
	at java.nio.charset.CoderResult.throwException(CoderResult.java:277)
	at sun.nio.cs.StreamDecoder.implRead(StreamDecoder.java:337)
	at sun.nio.cs.StreamDecoder.read(StreamDecoder.java:176)
	at java.io.InputStreamReader.read(InputStreamReader.java:184)
	at java.io.BufferedReader.fill(BufferedReader.java:153)
	at java.io.BufferedReader.read(BufferedReader.java:174)
	at scala.io.BufferedSource$$anonfun$iter$1$$anonfun$apply$mcI$sp$1.apply$mcI$sp(BufferedSource.scala:38)
	at scala.io.Codec.wrap(Codec.scala:64)
	at scala.io.BufferedSource$$anonfun$iter$1.apply(BufferedSource.scala:38)
	at scala.io.BufferedSource$$anonfun$iter$1.apply(BufferedSource.scala:38)
	at scala.collection.Iterator$$anon$14.next(Iterator.scala:150)
	at scala.collection.Iterator$$anon$25.hasNext(Iterator.scala:562)
	at scala.collection.Iterator$$anon$19.hasNext(Iterator.scala:400)
	at scala.io.Source.hasNext(Source.scala:238)
	at scala.collection.Iterator$class.foreach(Iterator.scala:772)
	at scala.io.Source.foreach(Source.scala:181)

I am perfectly happy to throw all the invalid characters away or replace them with some dummy.
I am going to have lots of text like this to process in various ways
and may need to pass the data to various third party libraries.
An ideal solution would be some kind of global setting that would
cause all the low level java libraries to ignore invalid bytes in text,
so that that I can call third party libraries on this data without modification.

SOLUTION:

    import java.nio.charset.CodingErrorAction
    import scala.io.Codec

    implicit val codec = Codec(&quot;UTF-8&quot;)
    codec.onMalformedInput(CodingErrorAction.REPLACE)
    codec.onUnmappableCharacter(CodingErrorAction.REPLACE)

    val src = Source.
      fromFile(filename).
      foreach(print)


Thanks to +Esailija for pointing me in the right direction.
This lead me to https://stackoverflow.com/questions/3801890/how-to-detect-illegal-utf-8-byte-sequences-to-replace-them-in-java-inputstream
which provides the core java solution. In Scala I can make this the default behaviour by making the codec implicit. I think I can make it the default behaviour for the entire package by putting it the implicit codec definition in the package object.



How to read a text file with mixed encodings in Scala or Java?

I&#39;m trying to convert `UTF-8` to `base64` string.

Example: I have &quot;abcdef==&quot; in `UTF-8`. It&#39;s in fact a &quot;representation&quot; of a `base64` string. 

How can I retrieve a &quot;abcdef==&quot; `base64` string (note that I don&#39;t want a &quot;abcdef==&quot; &quot;translation&quot; from `UTF-8`, I want to get a string encoded in `base64` which **is** &quot;abcdef==&quot;).

**EDIT**  
As my question seems to be unclear, here is a reformulation:

My byte array (let&#39;s say I name it A) is represented by a `base64` string. Converting A to `base64` gives me &quot;abcdef==&quot;.

This string representation is sent through a socket in UTF-8 (note that the string representation is exactly the same in UTF-8 and base64). So I receive an UTF-8 message which contains &quot;whatever/abcdef==/whatever&quot; in UTF-8. 

So I need to retrieve the base64 &quot;abcedf==&quot; string from this socket message in order to get A.

I hope this is more clear!

Convert UTF-8 to base64 string

I have a string that I receive from a third party app and I would like to display it correctly in any language using C# on my Windows Surface. 

Due to incorrect encoding, a piece of my string looks like this in Spanish: 
&gt;Acci&#195;&#179;n

whereas it should look like this: 
&gt;Acci&#243;n

According to the answer on this question:
   https://stackoverflow.com/questions/13993135/how-to-know-string-encoding-in-c-sharp/13994368#13994368, the encoding I am receiving should be coming on UTF-8 already, but it is read on Encoding.Default (probably ANSI?).

I am trying to transform this string into real UTF-8, but one of the problems is that I can only see a subset of the Encoding class (UTF8 and Unicode properties only), probably because I&#39;m limited to the windows surface API.

I have tried some snippets I&#39;ve found on the internet, but none of them have proved successful so far for eastern languages (i.e. korean). One example is as follows:

    var utf8 = Encoding.UTF8;
    byte[] utfBytes = utf8.GetBytes(myString);
    myString= utf8.GetString(utfBytes, 0, utfBytes.Length);     

I also tried extracting the string into a byte array and then using UTF8.GetString:

    byte[] myByteArray = new byte[myString.Length];
    for (int ix = 0; ix &lt; myString.Length; ++ix)
    {
        char ch = myString[ix];
        myByteArray[ix] = (byte) ch;
    }

    myString = Encoding.UTF8.GetString(myByteArray, 0, myString.Length);

Do you guys have any other ideas that I could try?



How can I transform string to UTF-8 in C#?

Python 3 uses UTF-8 encoding for source-code files by default. Should I still use the encoding declaration at the beginning of every source file? Like `# -*- coding: utf-8 -*-`

Should I use encoding declaration in Python 3?

[From here][1]

&gt; Essentially, string uses the UTF-16 character encoding form

But when saving vs [StreamWriter][2] : 

&gt; This constructor creates a StreamWriter with UTF-8 encoding without a
&gt; Byte-Order Mark (BOM),


I&#39;ve seen this sample (broken link removed):

![enter image description here][4]

And it looks like `utf8` is smaller for some strings while `utf-16` is smaller in some other strings.

* So why does .net use `utf16` as default encoding for string and `utf8` for saving files? 

Thank you.

*p.s. Ive already read [the famous article][5]*


  [1]: http://csharpindepth.com/Articles/General/strings.aspx
  [2]: http://msdn.microsoft.com/en-us/library/wtbhzte9.aspx
  [4]: http://i.stack.imgur.com/QYDgv.jpg
  [5]: http://www.joelonsoftware.com/articles/Unicode.html

Why does .net use the UTF16 encoding for string, but uses UTF-8 as default for saving files?

Is it really necessary to use `unsigned char` to hold binary data as in some libraries which work on character encoding or binary buffers? To make sense of my question, have a look at the code below -

    char c[5], d[5];
    c[0] = 0xF0;
    c[1] = 0xA4;
    c[2] = 0xAD;
    c[3] = 0xA2;
    c[4] = &#39;\0&#39;;
    
    printf(&quot;%s\n&quot;, c);
    memcpy(d, c, 5);
    printf(&quot;%s\n&quot;, d);

both the `printf&#39;s` output `&#150370;` correctly, where `f0 a4 ad a2` is the encoding for the Unicode code-point `U+24B62 (&#150370;)` in hex.

Even `memcpy` also correctly copied the bits held by a char.

What reasoning could possibly advocate the use of `unsigned char` instead of a `plain char`?

In other related questions `unsigned char` is highlighted because it is the only (byte/smallest) data type which is guaranteed to have no padding by the C-specification. But as the above example showed, the output doesn&#39;t seem to be affected by any padding as such.

I have used VC++ Express 2010 and MinGW to compile the above. Although VC gave the warning 

`warning C4309: &#39;=&#39; : truncation of constant value`

the output doesn&#39;t seems to reflect that.

P.S. This could be marked a possible duplicate of https://stackoverflow.com/questions/653336/should-a-buffer-of-bytes-be-signed-or-unsigned-char-buffer?rq=1 but my intent is different. I am asking why something which seems to be working as fine with `char` should be typed `unsigned char`?

**Update:** To quote from N3337, 

`Section 3.9 Types`

&gt; 2 For any object (other than a base-class subobject) of trivially
&gt; copyable type T, whether or not the object holds a valid value of type
&gt; T, the underlying bytes (1.7) making up the object can be copied into
&gt; an array of char or unsigned char. If the content of the array of char
&gt; or unsigned char is copied back into the object, the object shall
&gt; subsequently hold its original value.

In view of the above fact and that my original example was on Intel machine where `char` defaults to `signed char`, am still not convinced if `unsigned char` should be preferred over `char`.

Anything else?

C/C++ Why to use unsigned char for binary data?

I have the following string value: &quot;walmart obama &#128125;&#128148;&quot;

I am using MySQL and Java.

I am getting the following exception: `java.sql.SQLException: Incorrect string value: &#39;\xF0\x9F\x91\xBD\xF0\x9F...&#39;

Here is the variable I am trying to insert into:

    var1 varchar(255) CHARACTER SET utf8 COLLATE utf8_general_ci NOT NULL`

My Java code that is trying to insert &quot;walmart obama &#128125;&#128148;&quot; is a preparedStatement. So I am using the `setString()` method.

It looks like the problem is the encoding of the values &#128125;&#128148;. How can I fix this? Previously I was using Derby SQL and the values &#128125;&#128148; just ended up being two sqaures (I think this is the representation of the null character)

All help is greatly appreciated!

java.sql.SQLException: Incorrect string value: &#39;\xF0\x9F\x91\xBD\xF0\x9F...&#39;

I am new to XML and I am trying to understand the basics. I read the line below  in &quot;Learning XML&quot;, but it is still not clear, for me. Can someone point me to a book or website which explains these basics clearly?

From *Learning XML*:

&gt; The XML declaration describes some of the most general properties of
&gt; the document, telling the XML processor that it needs an XML parser to
&gt; interpret this document.

What does this mean?

I understand the `xml version` part - both doc and user of doc should &quot;talk&quot; in the same version of XML. But what about the `encoding` part? Why is that necessary?

Content Type	Original Author	Original Content on Stackoverflow
Question	Hakim	View Question on Stackoverflow
Solution 1 - Linux	Palantir	View Answer on Stackoverflow
Solution 2 - Linux	Zombo	View Answer on Stackoverflow
Solution 3 - Linux	Charles Knell	View Answer on Stackoverflow
Solution 4 - Linux	Mythos	View Answer on Stackoverflow
Solution 5 - Linux	atul jha	View Answer on Stackoverflow

How to remove non UTF-8 characters from text file

Linux Problem Overview

Linux Solutions

Solution 1 - Linux

Solution 2 - Linux

Solution 3 - Linux

Solution 4 - Linux

Solution 5 - Linux

Javascript library d3 call function

HTML entity for check mark

Attributions